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ANTAGONISTS OF BMP AND TGFp SIGNALING PATHWAYS 

The research leading to this invention was supported in part by The 
National Institute of Health Grant No. SR01HD3242902. Thus, the United States 
Government has certain rights in the invention. 

5 BACKGROUND OF THE INVENTION 

In virtually all animal phyla, critical steps in embryonic development are 
regulated by cell-to-cell, or inductive, signals mediated by secreted growth factors. In 
particular, members of the transforming growth factor P (TGFp) superfamily regulate a 
myriad of cellular and developmental processes, such as mitosis, cell differentiation, 
10 embryonic pattern formation and organogenesis. In vertebrate embryos, a variety of TGFP 
signals affect germ layer specification, body patterning, cell growth and differentiation (1- 
4). In embryos of the amphibian Xenopus, distinct TGFp members induce different cell 
fates, e.g., activin, Vgl and nodal induce mesoderm characteristic of the dorsal part of the 
embryo, such as notochord and muscle. Vgl and activin also induce endoderm 
15 characteristics. In contrast, Bone Morphogenetic Proteins (BMPs) specify mesoderm, such 
as blood and mesenchyme, and regulate epidermal and neural cell differentiation in the 
ectoderm (see (5) for a review). 

Cells respond to ligands in the TGFp family by transducing signals directly 
from cell surface receptor complexes to nuclear DNA targets via the Smad family of 
20 proteins (see (4), (6), (7), and (52) for a review). 

Smads are related to Drosophila Mad (mothers against decapentaplegic 
[dpp]) and proteins encoded by three related nematode genes Sma 2, Sma 3, and Sma 4. 
The terms Sma and Mad have been fused as Smad to unify the nomenclature. There are 
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eight members in the Smad family. Phosphorylated Smads 1, 5 and 8 are functional 
mediators of BMP family signaling in partnership with Smad 4. Smads 2 and 3 are signal 
transducers for actions of TGFp and activins. Smad 6 and Smad 7 function as antagonists 
to inhibit TGFp/BMP superfamily signaling. Interestingly, Smad7 is localized in the 
5 nucleus and accumulates in the cytoplasm in response to TGFp signalling (73). 

Furthermore, expression of both Smad6 and Smad7 is regulated by TGFbs, BMPs, growth 
factors and cytokines thereby providing for negative feedback regulation of the Smad 
signalling pathway (53-58). Phosphorylated Smad 1 forms a heteromeric complex with 
Smad 4 when entering the nucleus and activates transcription of early response genes. The 
10 BMP receptors may also signal via the mitogen activated protein kinase. It is likely that 
BMPs regulate cell cycle progression and thus govern differentiation of mesenchymal stem 
cells. 

Signal transduction in two major pathways, BMP and activin/TGFp, have 
been described in detail. Two distinct receptor subunits, the Type I and II transmembrane 

1 5 serine/threonine kinases, form activated complexes upon ligand binding. In these 

complexes the Type II subunit activates the Type I subunit, which directly phosphorylates 
' and activates particular receptor-regulated R-Smad proteins: BMP receptors target Smadl 
and closely-related Smads 5 and 8, while activin and TGFp receptors target Smadl and 
closely-related Smads 2 and 3. Upon activation these R-Smads form a heteromeric 

20 complex with Smad4, the "common partner" Smad. This complex translocates to the 

nucleus, binds to promoters of target genes in cooperation with DNA binding proteins, and 
activates transcription by recruiting coactivators. A third class of inhibitory Smads 
(I-Smads), Smad6 and Smad7, function as inhibitors that block Smad-Smad complex 
formation or Smad-receptor interactions. I-Smads bind to the cytoplasmic domain of 

25 receptors or directly to Smadl . Mutations in components at all levels in this pathway are 
associated with embryonic defects and various cancers, underscoring the importance of 
this growth factor family in developmental and disease processes. {See, (4), (7) for 
reviews.) In particular, defects of Smad2 and Smad4 are associated with colon and lung 
cancer and defects in human Smad4 are associated with pancreatic cancer. 

30 Smads do not have intrinsic enzymatic activity. Thus, the nature of the 

cellular response to Smad signaling is exquisitely sensitive to the level of Smad protein in 
the cell. Indeed, alternative cell fate determinations in Xenopus embryos can be achieved 
by relatively small changes in the amount of Smad protein expressed in the cell (8-1 1). 
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Therefore, regulating the level of Smad protein in the cell can be used as one means of 
modulating morphogenetic signaling by the TGFp superfamily. 

Protein modification by covalent attachment of ubiquitin is recognized as a 
general signal to target proteins for degradation via the proteasome (see (12), and (13) for 
5 a review). Targets of selective ubiquitination include transcription factors, cell cycle 
regulators, signal transduction proteins, and membrane proteins (references in (12)). 
Selective ubiquitination and degradation of specific target proteins can function as an 
important mechanism to control cell cycle progression, programmed cell death, 
differentiation and embryonic development. Dysfunction of the ubiquitination pathway is 
10 associated with disease and abnormal development. Ubiquitin ligases are part of a 
multimeric complex that catalyzes the covalent attachment of ubiquitin, a 12.5 kD 
polypeptide, to target proteins. Attachment of ubiquitin to its target serves as a molecular 
"flag" that marks the ubiquitinated protein for proteolytic degradation via an organelle 
known as the 26S proteosome. There are at least three enzymes involved in conjugating 
15 ubiquitin to target proteins, namely, El, E2 and E3. The El enzyme activates a ubiquitin 
molecule and conjugates it to the E2 enzyme which then either directly attaches ubiquitin 
onto a target protein, or passes it to the E3 ubiquitin ligase. The E3 recognizes a particular 
substrate and directs it ubiquitination. 

A few examples of developmental regulation by the ubiquitination system 
20 have been described in Dictyostelium (14-16) and Drosophila (17-21). Conjugation of 
ubiquitin to receptors is used in diverse systems to control endocytosis and signalling, as 
well as receptor steady state levels by both proteasome- and lysosome-mediated 
degradation (59-60). Direct ubiquitination of membrane receptors has been characterized 
in a number of systems, although in some cases ubiquitin-dependent regulation does not 
25 appear to involve direct conjugation of ubiquitin to the receptor (61-62). Although many 
cell surface receptors are regulated by ubiquitin-dependent pathways, few E3 ubiquitin 
ligases that bind to membrane proteins and target them for ubiquitination have been 
defined. Nedd4, a C2-WW-HECT domain E3 ubiquitin ligase, can regulate the turnover 
of the amiloride-sensitive sodium channel by binding directly to a PPXY motif present in 
30 the carboxy-terminus of the channel (63-66). Furthermore, the RING finger protein, c-cbl 5 
has recently been shown to function as an E3 ubiquitin ligase that binds to the EGF 
receptor to mediate ubiquitination and downregulation of the receptor complex (67-68). In 
these examples, ubiquitination of the membrane proteins appears to involve direct 
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interactions between the E3 ligase and the target protein. Whether adaptor proteins might 
also function to recruit E3 ligases to specific receptor complexes is unknown. The 
mechanism and targets of ubiquitination in the control of patterning have heretofore 
remained elusive. 

5 References that are cited throughout the specification by number are listed 

at the end of the Example. All references cited herein are incorporated by reference. 

SUMMARY OF THE INVENTION 

The present invention advantageously provides a class of regulatory 
proteins that are involved in BMP and TGFp-mediated activation. In particular, these 
10 proteins regulate Smad proteins and/or promote degradation of TGFp receptor complexes 
in the presence of Smad proteins. By manipulating the activity of the proteins of the 
invention, the skilled artisan can up or down regulate cellular activation, e.g., via BMP or 
TGFp. 

Thus, in a first aspect, the invention provides an isolated Smurf protein, and 
1 5 particularly a human Smurf protein. In one embodiment, it is a Smurfl protein. In an 
alternative embodiment, it is a Smurf2 protein. In specific embodiments, exemplified 
infra, a human Smurfl has the amino acid sequence depicted in Figure 10 (SEQ ID NO:2), 
and a human Smurf2 has the amino acid sequence depicted in Figure 12 (SEQ ID NO:4). 
Smurf proteins of the invention may contain at least about 5 and preferably at least about 
20 10 contiguous amino acids from the sequences depicted in SEQ ID NO:2 and 4. 

The invention further provides an antibody that specifically binds to Smurf 

protein. 

The invention further provides a nucleic acid encoding the Smurf proteins 
of the invention. In specific embodiments, the nucleic acid has a nucleotide sequence as 
25 depicted in SEQ ID NO:l or SEQ ID NO:3. 

The invention further provides an oligonucleotide or nucleic acid that 
specifically hybridizes under highly stringent conditions to a nucleic acid having a 
sequence encoding Smurf, or the complementary sequence thereof. Such hybridizable 
nucleic acids include probes (i.e., they may be labeled), primers (e.g., for PCR 
30 amplification), anti-sense nucleic acids, ribozymes, and triple-helix forming nucleic acids. 

The invention further provides a vector comprising the nucleic acid 
encoding Smurf, e.g., under control of an expression control sequence. Also provided are 
host cells, harboring such a vector, and methods for producing Smurf by culturing such 
host cells under conditions that permit expression of Smurf protein from the vector. 
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Also contemplated is a transgenic non-human animal that expresses a 
human Smurf protein and non-human animals in which endogenous Smurf protein is 
deleted. 

The invention further provides a method for inhibiting a bone morphogenic 
5 protein or transforming growth factor-beta activation pathway in a cell. This method 
comprises permitting the cell to grow under conditions that permit expression of Smurf 
from a vector of introduced into the cell. Alternatively, the invention provides a method 
for promoting a bone morphogenic protein or transforming growth factor-beta activation 
pathway in a cell, which method comprises suppressing expression of endogenous Smurf 
10 in the cell. 

In addition, the present discoveries permit screening for a modulator of 
Smurf activity. Screens of the invention comprise detecting modulation of Smurf activity 
in the host cell in the presence of a test compound relative to Smurf activity of the host cell 
in the absence of the test compound. As shown in the examples, one such activity is 
1 5 ubiquitination of Smad proteins. Another activity is the enhancement of TGF(3 receptor 
degradation. 

DESCRIPTION OF THE DRAWINGS 
Figure 1. Smurf 1 encodes cm E3 abiquitin ligase. Protein sequence of 
Xenopus and human Smurfl compared to yeast (S. pombe) publ, given as SMURF1, 
20 hSMURFl and PUB1, respectively in the figure. Identical amino acids are shaded dark 
gray and conservative substitutions are shaded light gray. Based on primary structure 
Smurfl and publ are members of the Hect family of E3 ubiquitin ligases and display 
several conserved features of the family: A Iipid/Ca2+ binding domain is located at the 
N-terminus (residues 22-37), two WW protein interaction domains at 236 - 271 and 282 
25 -311 (indicated by thick lines) , and a catalytic Hect domain beginning at residue 347 and 
extending to the C-terminus. Alignment was by Clustal W analysis (MacVector). 

Figures 2A and 2B. Northern blots of mSmurfl expression in embryonic 
and adult mouse tissues . Equal amounts of PolyA+ mRNA from indicated stages and 
tissues were analyzed. (2A) Embryonic tissue - In this blot, embryonic days post coitum 
30 are indicated. (2B) Adult tissues shown are T, testes; K, kidney; M, skeletal muscle; L, 
lung; Sp, spleen; Br, brain; H, heart. 

Figures 3 A and 3B. Developmental expression of Xenopus Smurfl. 
(A) RT-PCR on staged embryonic cDNA revealed that Xenopus Smurfl is 
a maternal mRNA, present at highest levels in egg, blastula and early gastrula stages. 
35 Zygotic Smurfl mRNA levels decline at gastrulation but maintain steady expression into 
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swimming tadpole stages. Numbers above each lane correspond to Nieuwkoop and Faber 
stages: 7 and 9, blastula; 1 1 and 13, gastrula; 15 and 20, neurula; 25 and 35, tadpole. 
Ornithine decarboxylase (ODC) mRNA, ubiquitously expressed in cells, was assayed to 
normalize for RNA recovery. RT-PCR on mock cDNA (no reverse transcriptase). 
5 (B) Whole-mount in situ hybridization of Smurf 1 in developing Xenopus 

embryos. In egg and blastula stages Smurfl transcripts are localized to the animal pole 
half (bracket in egg). Expression is diffuse throughout the ectoderm and involuting 
marginal zone of the gastrula; views from , animal pole (an) and vegetal pole (Veg). There 
is some enrichment of transcripts in the neural folds at neurula stage 17. At tadpole stages 

10 25 and 35 Smurfl expression includes the brain (b), eye (e), otic vesicle (o), somites (s), 
pharyngeal pouches (p) and developing kidney (k). 

Figures 4A, 4B, 4C, and 4D. Smurfl leads to a selective decrease in 
steady-state level ofSmadl and Smad5 in mammalian cell lines. Cells were transiently 
transfected with the indicated expression vectors (DNA quantities in jig) and two days 

15 later an aliquot representing approximately 0.4% of total cell lysates were subjected to 
SDS-PAGE and immunoblotting. To determine the steady-state level of the Smads, the 
blots were probed with the appropriate Smad antibody as shown. Flag-hSmurfl 
expression level was confirmed by probing blots of total cell lysates with the anti-Flag 
monoclonal antibody. 

20 (A) COS-1 or 293T-cells were transfected with a constant amount of 

pCMV5-Smadl and increasing concentrations of pCMV5-Flag-hSmurfl as indicated. To 
determine Smadl steady-state level and the expression of hSmurfl, Western blots of total 
cell lysates from both cell lines were probed with a-Smadl and a-Flag antibodies 
(a-Smadl and a-Flag blot). 

25 (B) 293T cells were transiently transfected with pCMV5-Smadl, wild type 

or activated (Q203D) pCMV5-ALK6-HA, and increasing amounts of 
pCMV5-Flag-hSmurfl as indicated. Smadl steady-state levels were examined by 
immunoblotting total cell lysates in Western blots with a-Smadl antibody (a-Smadl). 
hSmurfl and wild type or activated ALK6 expression levels were determined by 

30 immunoblotting the total cell lysates with a-Flag (a-Flag) or a-HA (a-HA) antibody, 
respectively. 

(C) 293T cells were transfected with a constant amount of pCMV5-Smadl 
or pCMV5-Smad2 along with the indicated concentrations of pCMV5-Flag-hSmurfl . 
Smadl (a-Smadl blot), Smad2 (a-Smad2 blot) and Flag-hSmurfl (a-Flag blot) 
35 steady-state protein levels were determined in cell lysates, as above. 
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(D) 293T cells were transfected with Smadl, Smad3, Smad4 or Smad5 in 
pCMV5 in the absence or presence of pCMV5-Flag-hSmurfl. The steady-state level of 
the Smads in total cell lysates was determined by incubating the Western blots with 
a-Smadl antibody for Smadl and Smad5, a-Smad3 antibody for Smad3, and a-Smad4 
5 antibody for Smad4 detection (a-Smads blot). Equivalent hSmurfl expression was 
confirmed as described above (a-Flag blot). 

Figures 5A, 5B, and 5C. hSmurfl regulates Smadl turnover and 
ubiquitination: hSmurfl enhances Smadl turnover. 

(A) COS-1 cells were transiently transfected with pCMV5-Smadl alone or 
10 with Flag-tagged hSmurfl (F/hSmurfl) using lipofect AMINE. Two days later, 

transfectants were subjected to pulse-chase analysis using [ 35 S] methionine. At the 
indicated time during the chase, cells were lysed and subjected to an a-Smadl 
immunoprecipitation. The immunoprecipitates were resolved by SDS-PAGE and 
visualized by autoradiography (left panel). Radiolabeled Smadl was also quantitated by 
1 5 phosphoimaging, and the results are plotted as the amount of [ 35 S] methionine-labelled 
Smadl present at each time point relative to the level at time 0 (right panel). 

(B) Ubiquitination of Smadl in 293T cells. Cells were transiently 
transfected with indicated combinations of HA-tagged ubiquitin (HA-Ub), pCMV5-Smadl 
and either wild type (WT) or the ubiquitin ligase mutant (CA) of Flag-tagged hSmurfl . 

20 Two days post-transfection, lysates were subjected to an a-Smadl immunoprecipitation 
(a-Smadl IP) followed by SDS-PAGE and immunoblotting with an a-HA monoclonal 
antibody (a-HA blot). Protein bands displaying immunoreactivity to a-HA are marked by 
the square bracket. Expression of Smadl or Flag-hSmurfl was confirmed by subjecting 
total cell lysates to immunoblotting with an a-Smadl polyclonal antibody (a-Smadl blot) 

25 or an a-Flag monoclonal antibody (a-Flag blot), respectively. 

(C) Loss in Smadl steady-state level by hSmurfl requires an intact 
ubiquitin ligase activity of the Hect domain. 293T cells were transfected with Smadl and 
increasing amounts of the wild type (WT) or the ubiquitin ligase mutant (C710A). Total 
cell lysates were analyzed for Smadl, hSmurfl or hSmurfl (C71 OA) protein by 

30 immunoblotting total cell lysates with the appropriate antibody, as described in Fig 3. 

Figures 6 A, 6B, and 6C. Interaction ofSmurfl and Smads. 
(A) Smurfl interacts with Smadl. Yeast two-hybrid assays were done on 
yeast co-transformed with combinations of Xenopus Smurfl (Xsmurf) together with 
Smadl, Smad2, lamin or vector alone. Only the combination of Smadl and Smurfl 
35 exhibited significant p-galactosidase activity (left panel, photograph of stained yeast 

colonies assayed in duplicate). Co-immunoprecipitation on in vitro translated proteins was 
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done by incubating 35S-labelled Smurfl together with in vitro translated, 35S-Met 
trace-labelled Flag-tagged Smadl (35S-F/Smadl), Smad2 (35S-F/Smad2) or Smad4 
(35S-F/Smad4) immobilized on anti-Flag affinity gel matrices. After washing, bound 
proteins were eluted and analyzed by SDS-PAGE (right panel). 
5 (B) hSmurfl interacts selectively with both Smadl and SmadS. 293T cells 

were transiently transfected with the pCMV5 expression vectors containing Smadl (SI), 
SmadS (S5) or Smad2 either alone, with wild type (WT) or the ubiquitin ligase mutant 
(CA) Flag-tagged hSmurfl (F/hSmurfl). To examine Smadl or Smad5 interaction with 
hSmurfl, blots of a-Flag immunoprecipitates were probed with a-Smadl polyclonal 

10 antibody (a-Smadl blot). To examine Smad2 interaction with hSmurfl, blots of a-Flag 
immunoprecipitates were probed with a-Smad2 polyclonal antibody (a-Smad2 blot). The 
level of hSmurfl in the immunoprecipitates and Smadl, SmadS and Smad2 in the total cell 
lysates, are shown in the lower two panels. 

(C) Specificity of hSmurfl actions on Smadl . Nedd4, a ubiquitin ligase 

15 related to hSmurfl, does not interact with and does not reduce the steady-state level of 
Smadl . 293T cells were transiently co-transfected with Smadl and either Flag-tagged 
hSmurfl (WT or CA) or Nedd4, (WT) or ligase mutant (CS) as indicated. Smadl 
coimmunprecipitation with hSmurfl (left panels) was determined as described above. To 
determine Smadl interaction withNedd4. cell lysates were subjected to 

20 immunoprecipitation using a-Hect-Nedd4 polyclonal antibody (a-Nedd4 IP) followed by 
immunoblotting with a-Smadl polyclonal antibody (a-Smadl blot), right panels. To 
determine the level of expression of Nedd4 in the samples, the respective blots were 
reprobed with a-WW2 Nedd4 polyclonal antibody (a-Nedd4 blot). Smadl levels in total 
cell lysates of these assays are shown by western blot in the bottom panel. 

25 Figures 7A and 7B. Smurfl dorsalizes prospective ventral mesoderm and 

neural izes ectoderm. 

(A) Four cell Xenopus blastulae were injected with Xenopus Smurf mRNA 
in the marginal zone of two ventral blastomeres (50 pg Smurfl mRNA per cell). Tadpoles 
that developed from injected embryos formed ectopic, dorsal axial structures (lower left 

30 panel, arrows). Co-injection of 100 pg Smadl together with 50 pg of Smurfl mRNA 
rescued the ectopic axial structures in all cases (lower tadpole). At early gastrulation 
ventral marginal zone (VMZ) pieces were explanted from another set of wild-type 
embryos, or embryos injected in the VMZ at the four cell stage with Smurfl mRNA alone, 
or Smurfl co-injected with Smadl. The VMZ explants were excised at early gastrulation, 

35 as depicted, then cultured until control embryos reached mid-tadpole, stage 28, when total 
VMZ RNA was prepared then assayed by RT-PCR for the expression of 
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eiythrocyte-specific a-globin, muscle-specific actin, and general cellular mRNA, eFl-a, as 
a control for RNA recover}' (lower right panel). Control PCR reactions were done on total 
embryonic RNA with or without reverse-transcription. 

(B) Fertilized egg animal poles were injected with Smurfl and/or Smadl 
5 mRNA, or were not injected. Animal caps were explanted at blastula stage 8, cultured to 
mid-gastrula stage 1 1, then total RNA was prepared and assayed by RT-PCR to detect 
expression of the cement gland marker, XAG, and a general neural marker, NCAM. 
Controls were as above. 

Figures 8A, 8B, and 8C. Smurfl alters embryonic cell competence to 
10 respond to Smadl and Smad2. 

(A) Smurfl blocks ventral mesoderm induction by Smadl . A constant 
amount (1 ng) of Smadl mRNA was injected alone, or together with an increasing amount 
of Smurfl, into fertilized egg animal poles and ventral mesoderm induction in animal cap 
explants was assayed by RT-PCR with primers for the Xhox3 and Xcad3 homeodomain 

15 genes. In lanes from left to right, respectively, animal caps were not injected or injected 
with Smurfl mRNA at doses of 100, 0, 25, 50, 100, and 200 pg. 

(B) Smurfl enhances dorsal mesoderm induction by Smad2. A constant 
amount (50 pg) of Smad2 mRNA was injected alone, or together with an increasing 
amount of Smurfl, and dorsal mesoderm induction was monitored by expression of myoD, 

20 which marks muscle, and goosecoid, a homeodomain gene expressed in the most dorsal 
type of mesoderm, the Spemann Organizer. Note that goosecoid expression was triggered 
from undetectable levels, as the dose of Smurfl was increased. Animal caps were injected 
as in panel a. 

(C) Dose-response of animal caps to Smad2. Animal caps were injected 
25 with synthetic mRNAs for Smad2. At 50 pg Smad2 MyoD was induced and reached 

maximal levels at 100 pg or more injected Smad2. Goosecoid was induced at a minimum 
Smad2 dose of 250 pg. The level of goosecoid induced by 250 pg Smad2 alone was 
equivalent to the level of goosecoid induced by a combination of 50 pg of Smad2 and 100 
pg of Smurfl (panel b). 
30 In all panels the far right two lanes correspond to PCR on wildtype 

embryonic RNA, with (RT+) and without (RT-) reverse transcription, respectively. eFl-a 
controls in all panels were as in Fig. 7. 

Figure 9. cDNA sequence of human Smurfl [SEQ ID NO: 1]. 

Figure 10. Protein sequence of human Smurfl [SEQ ID NO:2]. 
35 Figure 11. cDNA sequence of human Smurfl [SEQ ID NO: 3]. 

Figure 12. Protein sequence of human Smurf2 [SEQ ID NO: 4], 
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Smurf2 is member of the HECT E3 ubiquitin-ligase family. The C2 
(overline), WW (shaded) and HECT (double overline) domains and the Cys716Ala 
mutation (boxed) are shown. 

Figure 13. Comparison ofSmnrfl and Smurfl proteins. 
5 Schematic representation of Smurfl and Smurf2. The degree of amino acid 

identity (%) between the C2, WW and HECT domains is shown. 

Figures 14A and 14B. Smurf2 is expressed in mouse tissues, 

(A) Smurf2 is expressed throughout mouse embryogenesis. A 1 kb 
XhoI/NotI fragment encompassing the 3'UTR of mouse Smurf2 was used to probe a 

10 mouse embryonic Northern Blot (Clontech). 

(B) Expression of Smurf2 in adult mouse tissues. A multiple tissue 
northern blot (Clontech) was probed with a fragment of mouse Smurf2 as in A. 

Figures 15A, 15B, 15C, 15D, 15E and 15F. Smurfl interacts with Smad7. 
(A and B) Expression of Smurf2 does not decrease steady-state levels of 
15 the Smads. 293T cells were transfected with Flag-tagged Smadl, Smad2, Smad4 or HA- 
tagged Smad7 either alone or together with the Myc-tagged Smurf2. Aliquots of total cell 
lysates were immunoblotted to detect expression of Smurf2 and the Smads (Fig. 15A) or 
were subjected to immunoprecipitation with anti-Myc antibody followed by anti-Flag or 
anti-HA immunoblotting to detect any coprecipitating Smads (Fig. 15B). The migration 
20 of the anti-Myc heavy chain (IgH) is marked. 

(C) Expression of Smurf2 does not alter Smad7 turnover. COS-1 cells, 
transfected with either Smad7-HA alone or together with Flag-Smurf2, were pulse-labelled 
with [ 35 S] -methionine and then chased for the indicated times in media containing 
unlabelled methionine. [ 35 S]-labelled Smad7-HA in anti-HA immunoprecipitates was 

25 quantified by phosphorimaging and the levels in control cells (squares) and Smurf2 
expressing cells (circles) was plotted relative to the amount present at time 0. Data 
represents the average of two experiments +/- SD. 

(D) In vitro interaction of bacterially-expressed Smurf2 and Smad7. 
Bacterially-produced His-Smad7-HA protein was incubated with Ni 2+ -NTA, GST and 

30 GST-Smurf2. Bound material was visualized by SDS-PAGE and immunoblotting with 
anti-HA antibody. Levels of GST proteins were determined by Coomassie staining 
(bottom panel). 

(E) The P Y motif in Smad7 is an important determinant for mediating 
interaction with Smurf2. 293T cells were transfected with Flag-Smurf2 either alone or 

35 together with wild type (WT) or mutant Y211A (YA) or APYversions of Smad7-HA. Cell 
lysates were subjected to anti-Flag immunoprecipitation and coprecipitating Smad7 
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proteins were detected by immunoblotting with anti-Smad7 antibody. Smad7 expression 
was confirmed by immunoblotting aliquots of total cell lysates (bottom panel). 

(F) The WW domains of Smurf2 are necessary for binding to Smad7. 293T 
cells were transfected with Smad7-HA and either wild type (WT) or mutant (AWW1, 
5 AWW2 or AWW3) versions of Flag-Smurf2. Cell lysates were subjected to anti-Flag 
immunoprecipitation, and coprecipitating Smad7 was detected by immunoblotting with 
anti-HA antibody. Smad7 expression was confirmed by immunoblotting aliquots of total 
cell lysates (bottom panel). 

Figure 16. Smad7 recruits Smurfl to the TGFp receptor complex. 

10 COS-1 cells were transfected with various combinations of TpRII, TpRI- 

HA, Smad7-HA and wild type (WT) or mutant (C716A) Flag-Smurf2 as indicated. Cells 
were affinity-labelled with [ 125 I]TGFPand lysates immunoprecipitated with anti-Flag or 
anti-Smad7 antibodies. Coprecipitating receptor complexes were visualized by SDS- 
PAGE and autoradiography. The amount of coprecipitating TpRI was quantified by 

15 phosphorimaging (right panels). Receptor expression was confirmed by visualizing 
aliquots of total cell lysates by autoradiography. Smurf2 and Smad7 levels were 
confirmed by immunoblotting aliquots of total cell lysates with anti-Flag and anti-HA 
antibodies, respectively. 

Figures 17 A, 17B, 17C, 17D and 17E. Smurfl, induces degradation of 

20 TGFfireceptors and Smad7. 

(A) Smurfl expression in the absence of Smadl does not decrease receptor 
steady-state levels. 293T cells were transfected with various combinations of TpRII-HA, 
TpRI-HA and varying amounts of Flag-Smurf2 (plasmid DNA in micrograms) is 
indicated. Expression levels of proteins were determined by immunoblotting aliquots of 

25 total cell lysates using the appropriate antibodies as shown. 

(B) Smurfl in the presence ofSmad7 causes a decrease in steady-state 
receptor levels. 293T cells were transfected with Smad7-HA, either TpRII-HA and TpRI- 
HA (left panels) or with a constitutively active type I receptor, TpRI-HA (T204D) (right 
panels) together with increasing amounts of wild type (WT) or mutant Flag-Smurf2 

30 (C716A). Steady-state levels of the receptors, Smad7 and Smurf2 were determined by 
anti-HA or anti-Flag immunoblotting as indicated. 

(C) Smurfl increases the turnover rate of the receptor complex. COS-1 
cells transfected with TGFpreceptors (TpRII-HA and TpRI-HA) alone or together with 
Smad7-HA, Flag-Smurf2 or both were pulse-labelled with [ 35 S] -methionine and then 

35 chased for the indicated times in media containing unlabelled methionine. Cell lysates 
were subjected to anti-HA immunoprecipitation and the amount of labelled receptors and 
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Smad7 was quantified by phosphorimaging and is plotted relative to the amount present at 
time 0. 

(D) Proteasome and lysosome inhibitors block Smurfl-induced degradation 
of the receptor complex. COS-1 cells transfected with TGFp receptors (TpRII-HA and 
5 TpRI-HA), Smad7-HA and Flag-Smurf2 were pulse-labelled with [ 35 S]-methionine and 
then chased either in absence of inhibitors or in the presence of 30mM lactacystin, or 0.4 
mM chloroquine for the indicated times. Cell lysates were subjected to anti-HA 
immunoprecipitation and receptor and Smad7 levels were visualized by SDS-PAGE and 
autoradiography. 

10 (E) Smurf2 induces the ubiquitination of Smad7 in the presence of the 

receptors. 293T cells were transfected with HA-tagged ubiquitin together with various 
combinations of Smad7, TpRII, TpRI-Flag, and wild type (WT) or mutant (C716A) Myc- 
SmurfZ as indicated. Cell lysates were subjected to a double immunoprecipitation with 
anti-Smad7 antibodies followed by immunoblotting with anti-HA antibodies. Protein 

1 5 expression in aliquots of total cell lysates was confirmed by immunblotting. 

Figures 18A, 18B, 18C and 18D. Association ofSmurfl with Smad7 
enhances the Smad7 inhibitory activity. 

(A) Smad7(Y21 lAj binds to TGFJ3 receptors but has a reduced ability to 
recruit Smurf2 to the receptor complex. COS-1 cells were transfected with TGFp 

20 receptors (TpRII and TpRJ-HA) and either wild type (WT) or mutant (Y21 1 A) Smad7/HA 
in the absence or presence of Flag-Smurf2(C716A). Cells were affinity-labelled with 
[ 125 I]TGFP and lysates immunoprecipitated with anti-Smad7 or anti-Flag antibodies. 
Coprecipitating receptor complexes were visualized by SDS-PAGE and autoradiography. 
Total receptor expression was determined by autoradiography and Smad7 and Smurf2 

25 protein levels were confirmed by anti-HA or anti-Flag immunoblotting of aliquots of total 
cell lysates. 

(B and C) Smad7(Y21 1A) is not as effective as wild type Smad7 in 
inhibiting TGFb-dependent activation of transcription. HepG2 cells were transfected with 
the 3TP-Lux reporter alone or together with varying concentrations of wild type (WT) or 
30 mutant (Y211A) Smad7-HA. In (B), 0.3 ng/ml of each Smad7 plasmid was used. Cells 
were incubated in the presence or absence of TGFp 1, and luciferase activity was 
normalized to b-galactosidase activity and is plotted as the mean +/- SD of triplicates from 
representative experiments. 

(D) A model for Smad7 and Smurf2-mediated degradation of the TGFfi 
35 receptor complex. Smad7 binds directly to Smurf2 and associates with the TGFp receptor 
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complex. Thus, Smad7 functions as an adaptor protein that mediates degradation of the 
TGFp receptor complex. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to a family of genes encoding E3 ubiquitin 

5 ligases, called Smurf proteins, and includes full length, or naturally occurring forms, and 
any functionally active or antigenic fragments thereof, from any animal, particularly 
mammal or amphibian, and more particularly from a human source. In specific 
embodiments, the E3 ubiquitin ligases called Smurf 1 and SmurO are characterized. 

This invention is based, in part, on the surprising discovery that two novel 

10 members of the Hect family of ubiquitin ligases interact with Smads. These ligases have 
been named Smurf 1 and Smurf2. One of these, Smurf 1, specifically targets BMP 
pathway-specific Smads, thereby acting as an antagonist or negative regulator of BMP 
signaling. Smurf 1 directly interacts with Smads 1 and 5 and regulates their ubiquitination, 
turnover, and activity. In amphibian embryos, Smurf 1 inhibits endogenous BMP signals, 

15 resulting in altered pattern formation and cell fate in the mesoderm and ectoderm. Thus, 
the invention provides a unique regulatory protein link between the ubiquitination pathway 
and the control of cell fate, e.g., during embryonic development. The invention further 
provides a nucleic acid encoding Smurfl [SEQ ID NO: 1], in addition to the Smurf 1 
protein, [SEQ ID NO: 2], and mutant variants thereof. 

20 The other novel member of the Hect family of ubiquitin ligases is Smurf2. 

SmurE, a C2-WW-HECT domain E3 ubiquitin ligase is related to Smurfl. Smurf2 does 
not interact with Smadl, 2 or 4, nor does Smurf2 alter steady state levels of Smad 1, 2 or 
4. Smurf2, however, does interact with Smad 7, binding directly to a PPXY motif in 
Smad7. Smurf2 is involved with TGFP receptor degradation acting in partnership with 

25 Smad7 as an antagonist or negative regulator of TGFp signaling. Activation of TGFp 
signalling results in Smad7-dependent recruitment of Smurf2 to the TGFp receptor 
complex. In the absence of activated TGFp receptor complex, Smurf2 does not alter the 
steady state level and turnover of Smad7. Recruitment of Smurf2 to the TGFp receptor by 
Smad7 promotes the degradation of the Smad7- TGFp receptor complex by both 

30 proteasomal and lysosomal pathways. The studies described herein demonstrate that 

Smad7 functions as an adaptor protein that recruits Smurf2 to the TGFp receptor complex 
to promote its degradation and thereby down-regulate activated TGFp receptor complexes. 
Regulation of Smad 7 localization to the nucleus and interaction with Smurf2 may be used 
to control the inhibitory activity of the Smad 7-Smurf2 complex. 
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The invention further provides a nucleic acid encoding Smurf2 [SEQ ID 
NO: 3], in addition to the Smurf 2 protein [SEQ ID NO: 4] and variants thereof. 

E3 ubiquitin ligases display very selective substrate specificity, as is evident 
in the findings discussed herein. For example, Smurf 1 binds BMP pathway-specific 
5 Smads. Moreover, Smurfl is a unique signaling protein of the BMP pathway because it 
binds only to Smadl and Smad5 and has little affinity for Smad 2 (specific for TGFp and 
activin receptor pathways), and no affinity for Smad4 (common Smad signaling partner). 
As a result of this substrate specificity, Smurfl can effectively interfere or regulate 
biological responses to BMPs without consequence to activin pathways, i.e., effects on 

10 other TGFp signaling pathways are limited, or non-existent. 

BMPs/TGFp signalling pathway functions in tissue differentiation, 
morphogenesis, and cell growth control (e.g., (52)). As an antagonist to the signal 
transduction pathway mediated by the TGFp family, Smurfl will inhibit the BMP pathway 
in vivo or in vitro. As an integral component of regulatory system for degradation of 

1 5 TGFp receptors, Smurf2 will inhibit the TGFp signalling pathway. As a result, the Smurfs 
will block chondrogenesis, osteogenesis, blood differentiation, cartilage formation, neural 
tube patterning, retinal development, heart induction and morphogenesis, hair growth, 
tooth formation, gamete formation and a wide variety of tissue and organ formation 
processes, and hinder the regeneration, growth, maintenance, etc., of bone and other 

20 tissues that are dependent on the BMP pathway. 

In one embodiment, mutant forms of a Smurf protein or small molecule 
antagonists of Smurf, described infra, can be used to prevent ubiquitination of proteins, 
e.g., Smads or TGFp receptor, and therefore to preserve the signal transduction pathway 
mediated by BMPs. In addition, fragments of Smads can be generated that bind to either 

25 the Hect domain of a Smurf protein, which domain has the catalytic activity needed for 
ubiquitination of Smads, or to the WW domains that interact with the PPXY domains of 
Smads, thus precluding Smurfl from binding and ubiquitinating Smads. These fragments 
can also be used in screening assays for small molecule inhibitors of Smurf-Smad 
interaction, e.g., in an inhibition binding assay. Variants of a Smurf protein and fragments 

30 of Smads can be used to improve defective BMP and/or TGFp/activin signaling as a result 
of overexpressed or increased Smurf activity, which may contribute to a disease state. 

BMPs control bone differentiation and growth, and are already in clinical 
tests and applications related to bone growth and connective tissue repair. Smurf proteins 
represent a novel target for the discovery of drugs that can influence its function, thereby 

35 affecting cellular responses to BMPs, and thus having clinical applications. Therefore, a 
Smurf protein can be used for screening for various drugs and/or antibodies that can either 
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enhance the BMP pathway, or inhibit it by antagonizing or mimicking the activity of the 
protein, respectively. For example, since Smurfl is highly specific for binding Smadsl 
and 5, it can be used to screen for drugs that block or activate the BMP pathway, and 
selectively affect cellular responses to BMPs without consequence to other members of the 
5 TGFp superfamily. 

Smurf proteins operate within cells, at the level of Smad signal 
transduction, and therefore provide an alternative means to affect BMP and TGFp/activin 
signals. However, because Smurf proteins are intracellular proteins, any manipulations 
aimed at directly altering Smurf activity must operate intracellularly. Such manipulations 

10 include antisense and ribozyme technology, and intracellular antibody technology. 

Smurf may be delivered to cells in gene therapy regimens to block 
excessive signaling by particular growth factor pathways controlled by Smads, e.g., Smads 
that are targets for Smurfl or Smurf2. The examples herein show that simply increasing 
the expression levels of Smurfl in cells antagonizes the Smad signaling pathway. Thus 

1 5 overexpression of Smurfs by gene therapy may be used to correct clinical conditions that 
result from excessive Smad signaling. These may include, for example, hyperplasia of 
bone, tendon or cartilage tissues, or formation of other tissues, that are regulated by signals 
from receptors (such as BMP receptors) that utilize Smadl or SmadS for signal 
transduction. 

20 Smurf nucleic acids or partial sequences thereof (such as PCR probes) 

would be useful as molecular probes for identification of defective Smurf genes in the 
human genome, particularly where a mutation of a Smurf gene is found in association with 
a particular disease. Smurf proteins may be used as reagents for in vitro assays to identify 
proteins in cells that are targets for ubiquitination. Purified Smurfs may be reconstituted 

25 with purified ubiquitination enzymes (i.e. El and E2 components) and utilized in 
functional (ubiquitination) assays that are aimed at identifying novel target proteins 
introduced into the assays (as purified proteins or translated cDNAs of unknown identity). 

In accordance with the present invention there may be employed 
conventional molecular biology, microbiology, and recombinant DNA techniques within 

30 the skill of the art. Such techniques are explained fully in the literature. See, e.g., 
Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second 
Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York 
(herein "Sambrook et aL, 1989"); DNA Cloning: A Practical Approach Volumes I and II 
(D.N. Glover ed. 1985); Oligonucleotide Synthesis (MJ. Gait ed. 1984); Nucleic Acid 

35 Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; Transcription And Translation 
[B.D. Hames & S.J. Higgins, eds. (1984)]; Animal Cell Culture [R.I. Freshney, ed. 
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(1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; B. Perbal, ^ Practical 
Guide To Molecular Cloning (1984); F.M. Ausubel ^ a/, (eds.), Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994). 

If appearing herein, the following terms shall have the definitions set out 

5 below. 

The term "about" or "approximately" means within 20%, preferably within 
10%, and more preferably within 5% of a given value or range. 

As used herein, the term "isolated" means that the referenced material is 
free of components found in the natural environment in which the material is normally 

10 found. In particular, isolated biological material is free of cellular components. In the 
case of nucleic acid molecules, an isolated nucleic acid includes a PCR product, an 
isolated mRNA, a cDNA, or a restriction fragment. In another embodiment, an isolated 
nucleic acid is preferably excised from the chromosome in which it may be found, and 
more preferably is no longer joined to non-regulatory, non-coding regions, or to other 

15 genes, located upstream or downstream of the gene contained by the isolated nucleic acid 
molecule when found in the chromosome. In yet another embodiment, the isolated nucleic 
acid lacks one or more introns. Isolated nucleic acid molecules can be inserted into 
plasmids, cosmids, artificial chromosomes, and the like. Thus, in a specific embodiment, 
a recombinant nucleic acid is an isolated nucleic acid. An isolated protein may be 

20 associated with other proteins or nucleic acids, or both, with which it associates in the cell, 
or with cellular membranes if it is a membrane-associated protein. An isolated organelle, 
cell, or tissue is removed from the anatomical site in which it is found in an organism. An 
isolated material may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been isolated 

25 under conditions that reduce or eliminate unrelated materials, Le. 9 contaminants. For 
example, a purified protein is preferably substantially free of other proteins or nucleic 
acids with which it is associated in a cell; a purified nucleic acid molecule is preferably 
substantially free of proteins or other unrelated nucleic acid molecules with which it can be 
found within a cell. As used herein, the term "substantially free" is used operationally, in 

30 the context of analytical testing of the material. Preferably, purified material substantially 
free of contaminants is at least 50% pure; more preferably, at least 90% pure, and more 
preferably still at least 99% pure. Purity can be evaluated by chromatography, gel 
electrophoresis, immunoassay, composition analysis, biological assay, and other methods 
known in the art. 

35 A "gene" is used herein to refer to a portion of a DNA molecule that 

includes a polypeptide coding sequence operatively associated with expression control 
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sequences. In one embodiment, a gene can be a genomic or partial genomic sequence, in 
that it contains one or more introns. In another embodiment, the term gene refers to a 
cDNA molecule (i.e., the coding sequence lacking introns). 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
5 transcribed and translated into a polypeptide in a cell in vitro or in vivo when placed under 
the control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from 

10 eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. If the coding 
sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence. 

"Expression control sequences", e.g., transcriptional and translational 
control sequences, are regulatory sequences that flank a coding sequence, such as 

15 promoters, enhancers, suppressors, terminators, and the like, and that provide for the 

expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation signals 
are control sequences. On mRNA, a ribosome binding site is an expression control 
sequence. 

A "promoter sequence" is a DNA regulatory region capable of binding 

20 RNA polymerase in a cell and initiating transcription of a downstream (3' direction) 

coding sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence will be 

25 found a transcription initiation site (conveniently defined for example, by mapping with 
nuclease SI), as well as protein binding domains (consensus sequences) responsible for the 
binding of RNA polymerase. 

A coding sequence is "under the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence into 

30 mRNA, which is then trans-RNA spliced (if it contains introns) and translated into the 
protein encoded by the coding sequence. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate conditions of 

35 temperature and solution ionic strength (see Sambrook et al 9 supra). The conditions of 
temperature and ionic strength determine the "stringency" of the hybridization. For 
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preliminary screening for homologous nucleic acids, low stringency hybridization 
conditions, corresponding to a T m of 55 °C, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% 
milk, and no formamide; or 30% formamide, 5x SSC, 0.5% SDS). Moderate stringency 
hybridization conditions correspond to a higher T m , e.g., 40% formamide, with 5x or 6x 
5 SCC. High stringency hybridization conditions correspond to the highest T m , e.g., 50% 
formamide, 5x or 6x SCC. Hybridization requires that the two nucleic acids contain 
complementary sequences, although depending on the stringency of the hybridization, 
mismatches between bases are possible. The appropriate stringency for hybridizing 
nucleic acids depends on the length of the nucleic acids and the degree of 

10 complementation, variables well known in the art. The greater the degree of similarity or 
homology between two nucleotide sequences, the greater the value of T m for hybrids of 
nucleic acids having those sequences. The relative stability (corresponding to higher T m ) 
of nucleic acid hybridizations decreases in the following order: RNA:RNA, DNA:RNA, 
DNA:DNA. For hybrids of greater than 100 nucleotides in length, equations for 

15 calculating T m have been derived {see Sambrook et al, supra, 9.50-9.51). For 

hybridization with shorter nucleic acids, i.e.. oligonucleotides, the position of mismatches 
becomes more important, and the length of the oligonucleotide determines its specificity 
(see Sambrook et al. , supra, 1 1 .7-1 1 .8). A minimum length for a hybridizable nucleic acid 
is at least about 10 nucleotides; preferably at least about 15 nucleotides; and more 

20 preferably the length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" 
refers to a T m of 55 °C, and utilizes conditions as set forth above. In a preferred 
embodiment, the T m is 60°C; in a more preferred embodiment, the T m is 65 °C. In a 
specific embodiment, "high stringency" refers to hybridization and/or washing conditions 

25 at 68°C in 0.2XSSC, at 42°C in 50% formamide, 4XSSC, or under conditions that afford 
levels of hybridization equivalent to those observed under either of these two conditions. 

A "vector" is a recombinant nucleic acid construct, such as plasmid, phage 
genome, virus genome, cosmid, or artificial chromosome to which another DNA segment 
may be attached. In a specific embodiment, the vector may bring about the replication of 

30 the attached segment, e.g., in the case of a cloning vector. A segment of DNA is inserted 
into a vector at specific restriction sites. The segment of DNA encodes a polypeptide of 
interest and the segment and restriction sites are designed to ensure insertion of the 
segment in the proper reading frame for transcription and translation. 

A cell has been "transfected" by exogenous or heterologous DNA when 

35 such DNA has been introduced inside the cell. A cell has been "transformed" by 

exogenous or heterologous DNA when the transfected DNA effects a phenotypic change. 
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Preferably, the transforming DNA should be integrated (covalently linked) into 
chromosomal DNA making up the genome of the cell. 

The term "heterologous" refers to a combination of elements not naturally 
occurring. For example, heterologous DNA refers to DNA not naturally located in the 
5 cell, or in a chromosomal site of the cell Preferably, the heterologous DNA includes a 
gene foreign to the cell. A heterologous expression regulatory element is such an element 
operatively associated with a different gene than the one it is operatively associated with in 
nature. In the context of the present invention, a Smurfl or SmurfZ gene is heterologous 
to the plasmid vector DNA in which it is inserted for cloning or expression, and it is 

10 heterologous to a non-human host cell in which it is expressed, e.g., a CHO cell. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester anologs thereof, such as 

15 phosphorothioates and thioesters, in either single stranded form, or a double-stranded 

helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The 
term nucleic acid molecule, and in particular DNA or RNA molecule, refers only to the 
primary and secondary structure of the molecule, and does not limit it to any particular 
tertiary forms. Thus, this term includes double-stranded DNA found, inter alia, in linear 

20 or circular DNA molecules (e.g., restriction fragments), plasmids, and chromosomes. In 
discussing the structure of particular double-stranded DNA molecules, sequences may be 
described herein according to the normal convention of giving only the sequence in the 5' 
to 3' direction along the nontranscribed strand of DNA (i.e., the strand having a sequence 
homologous to the mRNA). A "recombinant DNA molecule" is a DNA molecule that has 

25 undergone a molecular biological manipulation. 

The invention provides antisense nucleic acids (including ribozymes), 
which may be used to inhibit expression of Smurfl , particularly to enhance the BMP 
pathway via Smadsl and 4. Thus, antisense nucleic acids corresponding to the Smurfl 
gene, or a fragment thereof, can be used to alter BMP pathways. The invention also 

30 provides antisense nucleic acids to inhibit expression of Smurf2, to enhance the TGFp 

signalling pathway. An "antisense nucleic acid" is a single stranded nucleic acid molecule 
which, upon hybridizing with complementary bases in an RNA or DNA molecule, inhibits 
the latter's role. If the RNA is a messenger RNA transcript, the antisense nucleic acid is a 
countertranscript or mRNA-interfering complementary nucleic acid. As presently used, 

35 "antisense" broadly includes RNA-RNA interactions, RNA-DNA interactions, ribozymes 
and RNase-H mediated arrest. Antisense nucleic acid molecules can be encoded by a 
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recombinant gene for expression in a cell (e.g., U.S. Patent No. 5,814,500; U.S. Patent No. 
5,81 1,234), or alternatively they can be prepared synthetically (e.g., U.S. Patent No. 
5,780,607). 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 
5 generally of at least 10, preferably at least 15, and more preferably at least 20 nucleotides, 
that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA 
molecule encoding a gene, mRNA, cDNA, or other nucleic acid of interest. 
Oligonucleotides can be labeled, e.g., with 32 P-nucleotides or nucleotides to which a label, 
such as biotin, has been covalently conjugated. In one embodiment, a labeled 

10 oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another 
embodiment, oligonucleotides (one or both of which may be labeled) can be used as PCR 
primers, either for cloning full length or a fragment of Smurf 1 or Smurf2, or to detect the 
presence of nucleic acids encoding Smurfl or SmurO. In a further embodiment, an 
oligonucleotide of the invention can form a triple helix with a Smurfl or Smurf? DNA 

15 molecule. In still another embodiment, a library of oligonucleotides arranged on a solid 
support, such as a silicon wafer or chip, can be used to detect various polymorphisms of 
interest. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic 
acid synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally 
occurring phosphoester analog bonds, such as thioester bonds, etc. 

20 Specific examples of synthetic oligonucleotides envisioned for this 

invention include oligonucleotides that contain phosphorothioates, phosphotriesters, 
methyl phosphonates, short chain alkyl, or cycloalkl intersugar linkages or short chain 
heteroatomic or heterocyclic intersugar linkages. Most preferred are those with CH 2 -NH- 
0-CH 2 , CH 2 -N(CH 3 )-0-CH 2 , CH 2 -0-N(CH 3 )-CH 2 , CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and O- 

25 N(CH 3 )-CH 2 -CH 2 backbones (where phosphodiester is 0-P0 2 -0-CH 2 ). US Patent No. 
5,677,437 describes heteroaromatic olignucleoside linkages. Nitrogen linkers or groups 
containing nitrogen can also be used to prepare oligonucleotide mimics (U.S. Patents No. 
5,792,844 and No. 5,783,682). US Patent No. 5,637,684 describes phosphoramidate and 
phosphorothioamidate oligomeric compounds. Also envisioned are oligonucleotides 

30 having morpholino backbone structures (U.S. Pat. No. 5,034,506). In other embodiments, 
such as the peptide-nucleic acid (PNA) backbone, the phosphodiester backbone of the 
oligonucleotide may be replaced with a polyamide backbone, the bases being bound 
directly or indirectly to the aza nitrogen atoms of the polyamide backbone (Nielsen et al., 
Science 254:1497, 1991). Other synthetic oligonucleotides may contain substituted sugar 

35 moieties comprising one of the following at the T position: OH, SH, SCH 3 , F, OCN, 
0(CH 2 ) n NH 2 or Q(CH 2 ) n CH 3 where n is from 1 to about 10; C, to C 10 lower alkyl, 
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substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF 3 ; OCF 3 ; O-; S-, or N-alkyl; O-, 
S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ;N0 2 ; N 3 ; NH 2 ; heterocycloalkyl; 
heterocycloalkaryl; aminoalkylamino; polyalkylamino; substitued silyl; a fluorescein 
moiety; an RNA cleaving group; a reporter group; an intercalator; a group for improving 
the pharmacokinetic properties of an oligonucleotide; or a group for improving the 
pharmacodynamic properties of an oligonucleotide, and other substituents having similar 
properties. Oligonucleotides may also have sugar mimetics such as cyclobutyls or other 
carbocyclics in place of the pentofuranosyl group. Nucleotide units having nucleosides 
other than adenosine, cytidine, guanosine, thymidine and uridine may be used, such as 
inosine. 

As used herein, the term "homologous" in all its grammatical forms and 
spelling variations refers to the relationship between proteins that possess a "common 
evolutionary origin," including proteins from superfamilies (e.g., the TGFp superfamily) 
and homologous proteins from different species (e.g., Smad (human), Mad (drosophila), 
etc.) (Reeck et al. 9 Cell 50:667, 1987). Such proteins, and their encoding genes, have 
sequence homology, as reflected by their high degree of sequence similarity. 

Accordingly, the term "sequence similarity" in all its grammatical forms 
refers to the degree of identity or correspondence between nucleic acid or amino acid 
sequences of proteins that may or may not share a common evolutionary origin (see Reeck 
et aL, supra). However, in common usage and in the instant application, the term 
"homologous," when modified with an adverb such as "highly," may refer to sequence 
similarity and may or may not relate to a common evolutionary origin. 

In a specific embodiment, two DNA sequences are "substantially 
homologous" or "substantially similar" when at least about 70-75%, and most preferably at 
least about 80-85% of the nucleotides match over the defined length of the DNA 
sequences. An example of such a sequence is an allelic or species variant of a Smurf gene 
of the invention. Sequences that are substantially homologous can be identified by 
comparing the sequences using standard software available in sequence data banks, or in a 
Southern hybridization experiment under, for example, stringent conditions as defined for 
that particular system. Defining appropriate hybridization conditions is within the skill of 
the art. See, e.g., Maniatis et a!., supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid 
Hybridization, supra. 

Similarly, in a particular embodiment, two amino acid sequences are 
"substantially homologous" or "substantially similar" when greater than 70% of the amino 
acids are identical, or greater than about 90% are similar (functionally similar). Preferably, 
the similar or homologous sequences are identified by alignment using, for example, the 
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GCG (Genetics Computer Group, Program Manual for the GCG Package, Version 7, 
Madison, Wisconsin) pileup program, BLAST, and Clustal W analysis (Mac Vector). 
Sequence comparison algorithms can also be found at 
http://www.nwfsc.noaa.gov/bioinformatics.html. 
5 The term "corresponding to" is used herein to refer similar or homologous 

sequences, whether the exact position is identical or different from the molecule to which 
the similarity or homology is measured. A nucleic acid or amino acid sequence alignment 
may include spaces. Thus, the term "corresponding to" refers to the sequence similarity, 
and not the numbering of the amino acid residues or nucleotide bases. 

10 "Homologous recombination" refers to the insertion of a foreign DNA 

sequence of a vector in a chromosome. Preferably, the vector targets a specific 
chromosomal site for homologous recombination. For specific homologous 
recombination, the vector will contain sufficiently long regions of homology to sequences 
of the chromosome to allow complementary binding and incorporation of the vector into 

15 the chromosome. Longer regions of homology, and greater degrees of sequence similarity, 
may increase the efficiency of homologous recombination. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when placed under 
the control of appropriate regulatory sequences. The boundaries of the coding sequence 

20 are determined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from 
eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. If the coding 
sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and 

25 transcription termination sequence will usually be located 3' to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation signals 
are control sequences. 

30 A "promoter sequence" is a DNA regulatory region capable of binding 

RNA polymerase in a cell and initiating transcription of a downstream (3' direction) 
coding sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 

35 transcription at levels detectable above background. Within the promoter sequence will be 
found a transcription initiation site (conveniently defined for example, by mapping with 
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nuclease SI), as well as protein binding domains (consensus sequences) responsible for the 
binding of RNA polymerase. 

A coding sequence is "under the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence into 
5 mRNA, which is then trans-RNA spliced and translated into the protein encoded by the 
coding sequence. 

A nucleic acid coding a protein of the Smurf family, e.g., Smurfl or 
Smurf2, whether genomic DNA or cDNA, can be isolated from any source, particularly 
from a human or xenopus cDNA or genomic library. Methods for obtaining genes are well 

10 known in the art, as described above (see, e.g., Sambrook et al, 1989, supra). In a 
specific embodiment, the invention provides cDNA sequences for human Smurfl 
(hSmurfl) and Smurf2 (hSmur£2) genes [SEQ ID NO: 1 and SEQ ID NO: 3J. 

Accordingly, any animal cell can potentially serve as the nucleic acid 
source for the molecular cloning of a Smurf gene. The DNA may be obtained by standard 

15 procedures known in the art from cloned DNA (e.g., a DNA "library"), including EST 
libraries and cDNA libraries prepared from tissues with high level expression of the 
protein (e.g., a Xenopus Stage 9 (blastula) cDNA library-these are the cells that evidence 
the highest levels of expression of Smurfl). Other cell lines that may express Smurfl or 
Smurf2 are frog blastula and gastula ectoderm, mesoderm, and endoderm; mouse 

20 embryonic stem cells; and various mammalian cells, such as NIH3T3, PC 12, 293T, Hela, 
and COS. DNA encoding a Smurf protein can also be obtained by chemical synthesis, by 
cDNA cloning, or by the cloning of genomic DNA, or fragments thereof, purified from the 
desired cell (See, for example, Sambrook et al., 1989, supra: Glover, D.M. (ed.), 1985, 
DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II). Clones 

25 derived from genomic DNA may contain regulatory and intron DNA regions in addition to 
coding regions; clones derived from cDNA will not contain intron sequences. Whatever 
the source, the gene should be molecularly cloned into a suitable vector for propagation of 
the gene. 

Identification of a specific DNA fragment containing a desired Smurf gene 
30 can be accomplished in various ways known in the art. For example, a portion of a Smurf 
gene exemplified below can be purified and labeled to prepare a labeled probe, and the 
generated DNA may be screened by nucleic acid hybridization^ to the labeled probe 
(Benton and Davis, Science 196:180, 1977; Grunstein and Hogness, Proc. Natl. Acad. Sci. 
U.S.A. 72:3961, 1975). Those DNA fragments with substantial homology to the probe, 
35 such as an allelic variant from another individual, will hybridize. In a specific 
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embodiment, high stringency hybridization conditions are used to identify a homologous 
Smurfl or Smurfi gene. 

The present invention also relates to cloning vectors containing genes 
encoding analogs and derivatives of a Smurf gene of the invention, e.g., Smurfl or 
5 SmurO, that have similar or homologous functional activity. The production and use of 
derivatives and analogs related to Smurfl and Smurf2 are within the scope of the present 
invention. For example, a truncated form of Smurf 1 or Smurf2 can be provided. Such a 
truncated form includes Smurfl or Smurf2 with a deletion. In a specific embodiment, the 
derivative or analog is functionally active, i.e., capable of exhibiting one or more 

10 functional activities associated with a full-length, wild-type Smurfl or Smurf2 of the 
invention. Such functions include binding to a Smad protein, e.g., Smadl, Smad5 or 
Smad7, and catalyzing the ubiquitination of the bound Smad, of the activated TGFp 
receptor/Smad complex, or another protein in a TGFp/activin pathway. In another 
embodiment, the fragment has binding affinity but lacks or has reduced catalytic capacity. 

15 A Smurf derivative can be made by altering encoding nucleic acid 

sequences by substitutions, additions or deletions that provide for functionally equivalent 
molecules. Alternatively, non-functional mutant forms of the Smurf proteins, that may for 
example compete with the wild-type Smurf protein in the BMP pathway, but which are 
less effective in ubiquitination of Smads, can be prepared for use in treating disorders 

20 associated with compromised BMP signaling pathways as described above. In a specific 
embodiment, infra, the mutation is C710A, described further in the Examples. In another 
specific embodiment, with respect to Smurf 2, the mutation is C716A. 

Further selection can be carried out on the basis of the properties of the 
gene, e.g., if the gene encodes a protein product having the isoelectric, electrophoretic, 

25 amino acid composition, partial or complete amino acid sequence, antibody binding 
activity, or ligand binding profile of a Smurf protein as disclosed herein. Thus, the 
presence of the gene may be detected by assays based on the physical, chemical, 
immunological, or functional properties of its expressed product. For example, cDNA 
clones, or DNA clones which hybrid-select the proper mRNAs, can be selected which 

30 produce a protein that, e.g., has similar or identical electrophoretic migration, isoelectric 
focusing or non-equilibrium pH gel electrophores is behavior, proteolytic digestion maps, 
or antigenic properties as determined for Smurfl, or Smurf2. 

The present invention also relates to analogs, and derivatives of a Smurf 
protein, homologs from other species, and mutant variants, which have the same or a 

35 homologous functional activity. The production and use of derivatives, analogs, and 
mutant variants related to a Smurf protein are within the scope of this invention. In a 
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specific embodiment, the derivative or analog is functionally active, i.e., capable of 
exhibiting one or more functional activities associated with a full-length, wild-type Smurf 1 
or Smurf2 protein of the invention. 

Smurf derivatives can be made by altering nucleic acid sequences by 
5 substitutions, additions or deletions that provide for functionally equivalent molecules. 
Preferably, derivatives are made that have enhanced functional activity relative to a native 
Smurf e.g., Smurfl or SmurfZ, or that lack a functional activity, such as catalytic activity 
in the Hect ubiquitin-ligase domain in the C-terminal portion of the Smurf proteins. In one 
specific embodiment, mutant hSmurfl having a point mutant at C710A is provided which 

10 disrupts the catalytic activity of the Hect domain thereby precluding ubiquitination of 

Smadsl and 4. In another specific embodiment, mutant hSmurf2 having a point mutation 
at C716A is provided which disrupts the catalytic activity of the Hect domain, precluding 
proteolytic degradation of the TGFp receptor-Smad 7 complex. Alternatively, Smurf 
protein derivatives may encode soluble fragments of a Smurf protein domain, e.g., WW 

15 domain, that have the same or greater affinity for the natural ligands, e.g. Smadsl, 5 or 7. 
Such soluble derivatives may be potent inhibitors of ligand (i.e., Smadsl, 5 or 7) binding 
to the Smurf proteins. 

In another specific embodiment, derivatives or fragments of Smadsl and 4 
can be made that bind Smurfl and preclude the E3 ligase from further binding cellular 

20 Smad and preventing its ubiquitination. In another embodiment, derivatives or fragments 
of Smad 7 can be made that reduce the association of Smurf2 to both Smad 7 and the 
TGFp receptor. Thus, in one embodiment the invention contemplates use of peptides 
containing the linker region of R-Smads having a PPXY sequence, which is a conserved 
motif recognized by WW domains (see, Rotin, Curr. Topics Microbiol. Immunol., 

25 228:1 15-133, 1998 and (33)) such as those found in Smurfl or Smurf2, and corresponding 
nucleic acid sequences. 

Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences which encode substantially the same amino acid sequence as a Smurf gene may 
be used in the practice of the present invention. These include but are not limited to allelic 

30 genes, homologous genes from other species, and nucleotide sequences comprising all or 
portions of Smurf genes which are altered by the substitution of different codons that 
encode the same amino acid residue within the sequence, thus producing a silent change. 
Likewise, Smurf derivatives of the invention include, but are not limited to, those 
containing, as a primary amino acid sequence, all or part of the amino acid sequence of a 

35 Smurf protein, including altered sequences in which functionally equivalent amino acid 
residues are substituted for residues within the sequence resulting in a conservative amino 
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acid substitution. For example, one or more amino acid residues within the sequence can 
be substituted by another amino acid of a similar polarity, which acts as a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
5 belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. Amino acids 
containing aromatic ring structures are phenylalanine, tryptophan, and tyrosine. The polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine and 
10 histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. Such alterations will not be expected to affect apparent molecular weight as 
determined by polyacrylamide gel electrophoresis, or isoelectric point. Particularly 
preferred substitutions include: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 
1 5 - Glu for Asp and vice versa such that a negative charge may be 

maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free CONH 2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino 

20 acid with a particularly preferable property. For example, a Cys may be introduced a 
potential site for disulfide bridges with another Cys. A His may be introduced as a 
particularly "catalytic" site (i.e., His can act as an acid or base and is the most common 
amino acid in biochemical catalysis). Pro may be introduced because of its particularly 
planar structure, which induces b-turns in the protein's structure. 

25 The genes encoding Smurf derivatives and analogs of the invention can be 

produced by various methods known in the art. The manipulations which result in their 
production can occur at the gene or protein level. For example, a cloned Smurf 1 or 
Smurf2 gene sequence can be modified by any of numerous strategies known in the art 
(Sambrook et al, 1989, supra). The sequence can be cleaved at appropriate sites with 

30 restriction endonuclease(s), followed by further enzymatic modification if desired, 

isolated, and ligated in vitro. In the production of the gene encoding a derivative or analog 
of a Smurf, care should be taken to ensure that the modified gene remains within the same 
translational reading frame as the gene, uninterrupted by translational stop signals, in the 
gene region where the desired activity is encoded. 

35 Additionally, a Smurf-encoding nucleic acid sequence can be mutated in 

vitro or in vivo, to create and/or destroy translation, initiation, and/or termination 
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sequences, or to create variations in coding regions and/or form new restriction 
endonuclease sites or destroy preexisting ones, to facilitate further in vitro modification. 
Preferably, such mutations enhance the functional activity of the mutated Smurf gene 
product. Any technique for mutagenesis known in the art can be used, including but not 
5 limited to, in vitro site-directed mutagenesis (Hutchinson, C, et aL 9 1978, J. Biol. Chem. 
253:6551; Zoller and Smith, 1984, DNA 3:479-488; Oliphant et al, 1986, Gene 44:177; 
Hutchinson et al, 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710), use of TAB** linkers 
(Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see Higuchi, 
1989, "Using PCR to Engineer DNA", in PCR Technology: Principles and Applications 

10 for DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate 
cloning vector. A large number of vector-host systems known in the art may be used. 
Possible vectors include, but are not limited to ; plasmids or modified viruses, but the 
vector system must be compatible with the host cell used. Examples of vectors include, 

1 5 but are not limited to, E. coli, bacteriophages such as lambda derivatives, or plasmids such 
as pBR322 derivatives or pUC plasmid derivatives, e.g., pGEX vectors, pMal-c, pFLAG, 
pGBT9, etc. The insertion into a cloning vector can, for example, be accomplished by 
ligating the DNA fragment into a cloning vector which has complementary cohesive 
termini. However, if the complementary restriction sites used to fragment the DNA are 

20 not present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Alternatively, any site desired may be produced by ligating nucleotide 
sequences (linkers) onto the DNA termini; these ligated linkers may comprise specific 
chemically synthesized oligonucleotides encoding restriction endonuclease recognition 
sequences. Recombinant molecules can be introduced into host cells via transformation, 

25 transfection, infection, electroporation, etc., so that many copies of the gene sequence are 
generated. Preferably, the cloned gene is contained on a shuttle vector plasmid, which 
provides for expansion in a cloning cell, e.g., E. coli, and facile purification for subsequent 
insertion into an appropriate expression cell line, if such is desired. For example, a shuttle 
vector, which is a vector that can replicate in more than one type of organism, can be 

30 prepared for replication in both E. coli and Saccharomyces cerevisiae by linking sequences 
from an E. coli plasmid with sequences from the yeast 2\x plasmid. 



Expression of Smurf Polypeptides 

The nucleotide sequence coding for a Smurf protein, or antigenic fragment, 
derivative or analog thereof, or a functionally active derivative, including a chimeric 
35 protein, thereof, can be inserted into an appropriate expression vector, i.e., a vector which 
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contains the necessary elements for the transcription and translation of the inserted protein- 
coding sequence. Such elements are termed herein a "promoter." Thus, the nucleic acid 
encoding a Smurf protein of the invention is operationally associated with a promoter in 
an expression vector of the invention. Both cDNA and genomic sequences can be cloned 
5 and expressed under control of such regulatory sequences. An expression vector also 
preferably includes a replication origin. 

The necessary transcriptional and translational signals can be provided on a 
recombinant expression vector, or they may be supplied by the native gene encoding a 
Smurf protein and/or its flanking regions. 
10 Potential host-vector systems include but are not limited to mammalian cell 

systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast 
vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid 
DNA. The expression elements of vectors vary in their strengths and specificities. 
1 5 Depending on the host-vector system utilized, any one of a number of suitable 
transcription and translation elements may be used. The host cell containing the 
recombinant vector comprising the nucleic acid encoding a Smurf protein of the invention 
is cultured in an appropriate cell culture medium under conditions that provide for 
expression by the cell. Useful host cells for expression of Smurf include C 2 C 12 , 293T, 
20 CHO, COS, HEK, Hela, HepG2, NIH3T3, PC12, P19 and other cell lines, and kidney, 
brain, and bone cells. 

A recombinant Smurf protein of the invention, or functional fragment, 
derivative, chimeric construct, or analog thereof, may be expressed chromosomally, after 
integration of the coding sequence by recombination. In this regard, any of a number of 
25 amplification systems may be used to achieve high levels of stable gene expression (See 
Sambrook et al., 1989, supra). 

Any of the methods previously described for the insertion of DNA 
fragments into a cloning vector may be used to construct expression vectors containing a 
gene consisting of appropriate transcriptional/translational control signals and the protein 
30 coding sequences. These methods may include in vitro recombinant DNA and synthetic 
techniques and in vivo recombination (genetic recombination). 

Expression of a Smurf protein may be controlled by any promoter/enhancer 
element known in the art, but these regulatory elements must be functional in the host 
selected for expression. Promoters which may be used to control Smurf gene expression 
35 include, but are not limited to, the SV40 early promoter region (Benoist and Chambon, 
1981, Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous 
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sarcoma virus (Yamamoto, et aL, 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et aL, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 
regulatory sequences of the metallothionein gene (Brinster et aL, 1982, Nature 296:39-42); 
prokaryotic expression vectors such as the p-lactamase promoter (Villa-Kamaroff, et aL, 
5 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et aL, 

1983, Proc. Natl. Acad. Sci. U.S.A. 80:21-25); see also "Useful proteins from recombinant 
bacteria" in Scientific American, 1980, 242:74-94; promoter elements from yeast or other 
fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 
(phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and the animal 

10 transcriptional control regions, which exhibit tissue specificity and have been utilized in 
transgenic animals: elastase I gene control region which is active in pancreatic acinar cells 
(Swift et aL, 1984, Cell 38:639-646; Ornitz et aL, 1986, Cold Spring Harbor Symp. Quant. 
Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425-515); insulin gene control region 
which is active in pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), 

15 immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et aL, 

1984, Cell 38:647-658; Adames et aL, 1985, Nature 318:533-538; Alexander et aL, 1987, 
Mol. Cell. Biol. 7:1436-1444), mouse mammary tumor virus control region which is active 
in testicular, breast, lymphoid and mast cells (Leder et aL, 1986, Cell 45:485-495), 
albumin gene control region which is active in liver (Pinkert et aL, 1987, Genes and Devel. 

20 1:268-276), alpha-fetoprotein gene control region which is active in liver (Krumlauf et aL, 

1985, Mol. Cell. Biol. 5:1639-1648; Hammer et aL, 1987, Science 235:53-58), alpha 1- 
antitrypsin gene control region which is active in the liver (Kelsey et aL, 1987, Genes and 
Devel. 1:161-171), beta-globin gene control region which is active in myeloid cells 
(Mogram et aL, 1985, Nature 315:338-340; Kollias et aL, 1986, Cell 46:89-94), myelin 

25 basic protein gene control region which is active in oligodendrocyte cells in the brain 

(Readhead et aL, 1987, Cell 48:703-712), myosin light chain-2 gene control region which 
is active in skeletal muscle (Sani, 1985, Nature 314:283-286), and gonadotropic releasing 
hormone gene control region which is active in the hypothalamus (Mason et aL, 1986, 
Science 234:1372-1378). 

30 A wide variety of host/expression vector combinations may be employed in 

expressing the DNA sequences of this invention. Useful expression vectors, for example, 
may consist of segments of chromosomal, non-chromosomal and synthetic DNA 
sequences. Suitable vectors include derivatives of SV40 and known bacterial plasmids, 
e.g., E. coli plasmids col El, pCRl, pBR322, pMal-C2, pET, pGEX (Smith et aL, 1988, 

35 Gene 67:31-40), pMB9 and their derivatives, plasmids such as RP4; phage DNAS, e.g., 
the numerous derivatives of phage 1, e.g., NM989, and other phage DNA, e.g., M13 and 
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filamentous single stranded phage DNA; yeast plasmids such as the 2m plasmid or 
derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in insect or 
mammalian cells; vectors derived from combinations of plasmids and phage DNAs, such 
as plasmids that have been modified to employ phage DNA or other expression control 
5 sequences, and the like. 

For example, in a baculovirus expression systems, both non-fusion transfer 
vectors, such as but not limited to pVL941 (BamHl cloning site; Summers), pVL1393 
(BamHl , Smal, Xbal, EcoRl , Noil, XmallL Bglll, and Pstl cloning site; Invitrogen), 
pVL1392 (Bglll, Pstl, Notl, Xmalll, EcoRl, Xbal, Smal, and BamHl cloning site; 

10 Summers and Invitrogen), and pBlue5<r/cIII (BamHl , Bglll, Pstl, Ncol, and Hindlll cloning 
site, with blue/white recombinant screening possible; Invitrogen), and fusion transfer 
vectors, such as but not limited to pAc700 (BamHl and Kpnl cloning site, in which the 
BamHl recognition site begins with the initiation codon; Summers), pAc701 and pAc702 
(same as pAc700, with different reading frames), pAc360 (BamHl cloning site 36 base 

15 pairs downstream of a polyhedrin initiation codon; Invitrogen(195)), and pBlueBacHisA, 
B, C (three different reading frames, with BamHl, Bglll, Pstl, Ncol, and Hindlll cloning 
site, an N-terminal peptide for ProBond purification, and blue/white recombinant 
screening of plaques; Invitrogen (220)) can be used. 

Mammalian expression vectors contemplated for use in the invention 

20 include vectors with inducible promoters, such as the dihydrofolate reductase (DHFR) 
promoter, e.g., any expression vector with a DHFR expression vector, or a 
D//F7?/methotrexate co-amplification vector, such as pED (Pstl, Sail, Sbal, Smal, and 
EcoRl cloning site, with the vector expressing both the cloned gene and DHFR; see 
Kaufman, Current Protocols in Molecular Biology, 16.12 (1991)). Alternatively, a 

25 glutamine synthetase/methionine sulfoximine co-amplification vector, such as pEE14 
(Hindlll, Xbal, Smal, Sbal, EcoRl, and Bell cloning site, in which the vector expresses 
glutamine synthase and the cloned gene; Celltech). In another embodiment, a vector that 
directs episomal expression under control of Epstein Barr Virus (EBV) can be used, such 
as pREP4 (BamHl, Sfil, Xhol, Notl, Nhel, Hindlll, Nhel, Pvull, and Kpnl cloning site, 

30 constitutive RSV-LTR promoter, hygromycin selectable marker; Invitrogen), pCEP4 

(BamHl, Sfil, Xhol, Notl, Nhel, Hindlll, Nhel, Pvull, and Kpnl cloning site, constitutive 
hCMV immediate early gene, hygromycin selectable marker; Invitrogen), pMEP4 (Kpnl, 
Pvul, Nhel, Hindlll, Notl, Xhol, Sfil, BamHl cloning site, inducible methallothionein Ila 
gene promoter, hygromycin selectable marker: Invitrogen), pREP8 (BamHl, Xhol, Notl, 

35 Hindlll, Nhel, and Kpnl cloning site, RSV-LTR promoter, histidinol selectable marker; 
Invitrogen), pREP9 (Kpnl, Nhel, Hindlll, Notl, Xhol, Sfil, and BamHl cloning site, RSV- 
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LTR promoter, G418 selectable marker; Invitrogen), and pEBVHis (RSV-LTR promoter, 
hygromycin selectable marker, N-terminal peptide purifiable via ProBond resin and 
cleaved by enterokinase; Invitrogen). Selectable mammalian expression vectors for use in 
the invention include pRc/CMV (Hindlll, BstXl, Notl, Sbal, and Apal cloning site, G418 
5 selection; Invitrogen), pRc/RSV (Hindlll, Spel, BstXl, Notl, Xbal cloning site, G41 8 
selection; Invitrogen), and others. Vaccinia virus mammalian expression vectors {see, 
Kaufman, 1991, supra) for use according to the invention include but are not limited to 
pSCl 1 (Smal cloning site, TK- and b-gal selection), pMJ601 {Sail, Smal, Afll, Narl, 
BspMll, BamUl, Apal, Nhel, Sacll, Kpnl, and Hindlll cloning site; TK- and b-gal 

1 0 selection), and pTKgptFl S (EcoRI, Pstl, Sail, Accl, Hindll, Sbal, BamHl, and Hpa cloning 
site, TK or XPRT selection). 

Yeast expression systems can also be used according to the invention to 
express Smurf 1 . For example, the non-fusion pYES2 vector (Xbal, Sphl, Shol, Notl, 
GstXl, EcoRl, BstXl, BamHl, Sad, Kpnl, and Hindlll cloning sit; Invitrogen) or the 

1 5 fusion p YESHis A, B, C {Xbal, Sphl, Shol, Notl, BstXl, EcoRI, BamH 1 , Sacl, Kpnl, and 
Hindlll cloning site, N-terminal peptide purified with ProBond resin and cleaved with 
enterokinase; Invitrogen), to mention just two, can be employed according to the 
invention. 

Once a particular recombinant Smurf DNA molecule is identified and 

20 isolated, several methods known in the art may be used to propagate it. Once a suitable 
host system and growth conditions are established, recombinant expression vectors can be 
propagated and prepared in quantity. As previously explained, the expression vectors 
which can be used include, but are not limited to, the following vectors or their 
derivatives: human or animal viruses such as vaccinia virus or adenovirus; insect viruses 

25 such as baculovirus; yeast vectors; bacteriophage vectors (e.g., lambda), and plasmid and 
cosmid DNA vectors, to name but a few. 

In addition, a host cell strain may be chosen which modulates the 
expression of the inserted sequences, or modifies and processes the gene product in the 
specific fashion desired. Different host cells have characteristic and specific mechanisms 

30 for the translational and post-translational processing and modification of proteins. 

Appropriate cell lines or host systems can be chosen to ensure the desired modification 
and processing of the foreign protein expressed. For example, expression in a bacterial 
system can be used to produce a nonglycosylated core protein product, while expression in 
yeast can produce a glycosylated product. Expression in eukaryotic cells can increase the 

35 likelihood of "native" folding of a heterologous mammalian protein. Moreover, 

expression in mammalian cells can provide a tool for reconstituting, or constituting, Smurf 
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activity. Furthermore, it is known in the art that different vector/host expression systems 
may affect processing reactions (e.g., proteolytic cleavages) to a different extent. 

Vectors are introduced into the desired host cells by methods known in the 
art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE 
5 dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, 
or a DNA vector transporter (see, e.g., Wu et al, 1992, J. Biol. Chem. 267:963-967; Wu 
and Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut et al, Canadian Patent 
Application No. 2,012,31 1, filed March 15, 1990). 

Analysis of Gene Expression Mediated by Smurf Functional Activity 

10 In one embodiment, oligonucleotide array technology can be used, e.g., to 

evaluate gene expression, after binding of Smurf 1 to Smadsl or 5, or binding of Smurf2 to 
Smad7, and to identify gene expression that correlates with, or is distinct from, gene 
expression in TGFp or BMP treated cells or cells of injured and/or healing tissue, tissue 
having cells in a tumorigenic state, and during cellular and developmental processes, such 

1 5 as mitosis, cell differentiation, embryonic pattern formation and development and 

organogenesis. For example, one could compare gene expression in TGFp or BMP treated 
cells in the presence or absence of Smurf expression. GeneChip expression analysis 
(Affymetrix, Santa Clara, C A) generates data for the assessment of gene expression 
profiles and other biological assays. Oligonucleotide expression arrays simultaneously and 

20 quantitatively interrogate thousands of mRNA transcripts (genes or ESTs), simplifying 
large genomic studies. Each transcript can be represented on a probe array by multiple 
probe pairs to differentiate among closely related members of gene families. Each probe 
cell contains millions of copies of a specific oligonucleotide probe, permitting the 
accurate and sensitive detection of low-intensity mRNA hybridization patterns. 

25 Differential expression data can provide a clear understanding of cellular pathways. 

After hybridization intensity data is captured, e.g., using a Hewlett-Packard 
GeneArray™ scanner. Software can be used to automatically calculate intensity values for 
each probe cell. Probe cell intensities can be used to calculate an average intensity for 
each gene, which directly correlates with mRNA abundance levels. Expression data can 

30 be quickly sorted on any analysis parameter and displayed in a variety of graphical formats 
for any selected subset of genes. Standard and custom GeneChip expression probe arrays 
are available today for human, mouse, yeast and other organisms. The GeneChip product 
line will expand to include expression arrays for the analysis of additional organisms and 
application areas such as toxicology and pharmacogenomics. 



Transgenic Vectors 

Smurf can be introduced into cells to treat a disorder associated with excess 
BMP or TGFp activation, such as cancer. Smurf activity can be evaluated by introducing a 
Smurf vector into a cell and monitoring the cell upon Smurf expression. This can be done 
5 in vitro or in vivo, or in vitro followed by transplantation in vivo, also termed ex vivo. 
Alternatively, as discussed above, Smurf or a Smurf inhibitor (antisense, ribozyme, or 
intracellular antibody) can be delivered by a vector in modulate Smads, e.g., to prevent 
Smurf regulation of Smads where BMP or TGFp activity is desired, such as in bone 
regeneration, or to study Smurf regulated processes in vivo. 

10 Smurf activity can be inhibited by various means, including by delivery of a 

vector encoding a dominant-negative Smurf derivative {e.g., a Cys710 to Ala mutant) to 
cells, by antisense nucleic acids (including ribozymes and triple-helix-forming 
oligonucleotides; these are described in detail supra), and by expression of anti-Smurf 
intracellular antibodies, e.g., single chain Fv antibodies (see generally Chen, Mol. Med. 

15 Today, 3:160-167, 1997; Spitz et al, Anticancer Res., 16:3415-3422, 1996; Indolfi et al, 
Nat. Med., 2:634-635, 1996; Kijima <?/ a/., Pharmacol. Ther., 68:247-267, 1995). 

As discussed above, a vector is any means for the transfer of a nucleic acid 
according to the invention into a host cell. These include viral vectors, such as 
lentiviruses, retroviruses, herpes viruses, adenoviruses and adeno-associated viruses. 

20 Thus, a gene encoding a functional or mutant Smurf protein or polypeptide domain 
fragment thereof can be introduced in vivo, ex vivo, or in vitro using a viral vector or 
through direct introduction of DNA. Expression in targeted tissues can be effected by 
targeting the transgenic vector to specific cells, such as with a viral vector or a receptor 
ligand, or by using a tissue-specific promoter, or both. Targeted gene delivery is described 

25 in International Patent Publication WO 95/28494, published October 1995. 

Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for constructing and 
using viral vectors are known in the art [see, e.g., Miller and Rosman, BioTechniques 
7:980-990 (1992)]. Preferably, the viral vectors are replication defective, that is, they are 

30 unable to replicate autonomously in the target cell. In general, the genome of the 

replication defective viral vectors which are used within the scope of the present invention 
lack at least one region which is necessary for the replication of the virus in the infected 
cell. These regions can either be eliminated (in whole or in part), be rendered non- 
functional by any technique known to a person skilled in the art. These techniques include 

35 the total removal, substitution (by other sequences, in particular by the inserted nucleic 
acid), partial deletion or addition of one or more bases to an essential (for replication) 
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region. Such techniques may be performed in vitro (on the isolated DNA) or in situ, using 
the techniques of genetic manipulation or by treatment with mutagenic agents. Preferably, 
the replication defective virus retains the sequences of its genome which are necessary for 
encapsidating the viral particles. 
5 DNA viral vectors include an attenuated or defective DNA virus, such as 

but not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), 
adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely 
or almost entirely lack viral genes, are preferred. Defective virus is not infective after 
introduction into a cell. Use of defective viral vectors allows for administration to cells in 

10 a specific, localized area, without concern that the vector can infect other cells. Thus, a 
specific tissue can be specifically targeted. Examples of particular vectors include, but are 
not limited to, a defective herpes virus 1 (HSV1) vector [Kaplitt et al, Molec. Cell. 
NeuroscL, 2:320-330 (1991)], defective herpes virus vector lacking a glyco-protein L gene 
[Patent Publication RD 371005 A], or other defective herpes virus vectors [International 

15 Patent Publication No. WO 94/21807, published September 29, 1994; International Patent 
Publication No. WO 92/05263, published April 2, 1994]; an attenuated adenovirus vector, 
such as the vector described by Stratford-Perricaudet et al [J. Clin. Invest., 90:626-630, 
1992; see also La Salle et al, Science, 259:988-990, 1993]; and a defective adeno- 
associated virus vector [Samulski et al, J. Virol., 61:3096-3101, 1987; Samulski et al, J. 

20 Virol., 63:3822-3828,1989; Lebkowski et al, Mol. Cell. Biol., 8:3988-3996, 1988)]. 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, to 
avoid immuno-deactivation of the viral vector and transfected cells. For example, 
immunosuppressive cytokines, such as interleukin-12 (IL-12), interferon-g (IFN-g), or 

25 anti-CD4 antibody, can be administered to block humoral or cellular immune responses to 
the viral vectors [see, e.g., Wilson, Nature Medicine (1995)]. In addition, it is 
advantageous to employ a viral vector that is engineered to express a minimal number of 
antigens. 

Adenovirus vectors. Adenoviruses are eukaryotic DNA viruses that can be 
30 modified to efficiently deliver a nucleic acid of the invention to a variety of cell types. 

Various serotypes of adenovirus exist. Of these serotypes, preference is given, within the 
scope of the present invention, to using type 2 or type 5 human adenoviruses (Ad 2 or Ad 
5) or adenoviruses of animal origin (see W094/26914). Those adenoviruses of animal 
origin which can be used within the scope of the present invention include adenoviruses of 
35 canine, bovine, murine (example: Mavl, Beard et al, Virology 75 (1990) 81), ovine, 

porcine, avian, and simian (example: SAV) origin. Preferably, the adenovirus of animal 
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origin is a canine adenovirus, more preferably a CAV2 adenovirus (e.g. Manhattan or 
A26/61 strain (ATCC VR-800), for example). 

Preferably, the replication defective adenoviral vectors of the invention 
comprise the ITRs, an encapsidation sequence and the nucleic acid of interest. Still more 
5 preferably, at least the El region of the adenoviral vector is non-functional. The deletion in 
the El region preferably extends from nucleotides 455 to 3329 in the sequence of the Ad5 
adenovirus (PvuII-Bglll fragment) or 382 to 3446 (HinfII-Sau3A fragment). Other regions 
may also be modified, in particular the E3 region (WO95/02697), the E2 region 
(W094/28938), the E4 region (W094/28152, W094/12649 and WO95/02697), or in any 

10 of the late genes L1-L5. 

In a specific embodiment, the adenoviral vector has a deletion in the El 
region (Ad 1.0). Examples of El -deleted adenoviruses are disclosed in EP 185,573, the 
contents of which are incorporated herein by reference. In another embodiment, the 
adenoviral vector has a deletion in the El and E4 regions (Ad 3.0). Examples of E1/E4- 

1 5 deleted adenoviruses are disclosed in WO95/02697 and W096/22378, the contents of 
which are incorporated herein by reference. In still another preferred embodiment, the 
adenoviral vector has a deletion in the El region into which the E4 region and the nucleic 
acid sequence are inserted (see FR94 13355, the contents of which are incorporated herein 
by reference). 

20 The replication defective recombinant adenoviruses according to the 

invention can be prepared by any technique known to the person skilled in the art (Levrero 
etaL, Gene 101:195 1991; EP 185 573; Graham, EMBO J. 3:2917, 1984). In particular, 
they can be prepared by homologous recombination between an adenovirus and a plasmid 
which carries, inter alia, the DNA sequence of interest. The homologous recombination is 

25 effected following cotransfection of the said adenovirus and plasmid into an appropriate 
cell line. The cell line which is employed should preferably (i) be transformable by the 
said elements, and (ii) contain the sequences which are able to complement the part of the 
genome of the replication defective adenovirus, preferably in integrated form in order to 
avoid the risks of recombination. Examples of cell lines which may be used are the human 

30 embryonic kidney cell line 293 (Graham et aL, J. Gen. Virol. 36:59 1977) which contains 
the left-hand portion of the genome of an Ad5 adenovirus (12%) integrated into its 
genome, and cell lines which are able to complement the El and E4 functions, as 
described in applications W094/26914 and WO95/02697. Recombinant adenoviruses are 
recovered and purified using standard molecular biological techniques, which are well 

35 known to one of ordinary skill in the art. 
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Adeno-associated viruses. The adeno-associated viruses (AAV) are DNA 
viruses of relatively small size which can integrate, in a stable and site-specific manner, 
into the genome of the cells which they infect. They are able to infect a wide spectrum of 
cells without inducing any effects on cellular growth, morphology or differentiation, and 
5 they do not appear to be involved in human pathologies. The AAV genome has been 
cloned, sequenced and characterized. It encompasses approximately 4700 bases and 
contains an inverted terminal repeat (ITR) region of approximately 145 bases at each end, 
which serves as an origin of replication for the virus. The remainder of the genome is 
divided into two essential regions which carry the encapsidation functions: the left-hand 

10 part of the genome, which contains the rep gene involved in viral replication and 

expression of the viral genes; and the right-hand part of the genome, which contains the 
cap gene encoding the capsid proteins of the virus. 

The use of vectors derived from the AAVs for transferring genes in vitro 
and in vivo has been described (see WO 91/18088; WO 93/09239; US 4,797,368, US 

15 5,139,941, EP 488 528). These publications describe various AAV-derived constructs in 
which the rep and/or cap genes are deleted and replaced by a gene of interest, and the use 
of these constructs for transferring the said gene of interest in vitro (into cultured cells) or 
in vivo, (directly into an organism). The replication defective recombinant AAVs 
according to the invention can be prepared by cotransfecting a plasmid containing the 

20 nucleic acid sequence of interest flanked by two AAV inverted terminal repeat (ITR) 

regions, and a plasmid carrying the AAV encapsidation genes (rep and cap genes), into a 
cell line which is infected with a human helper virus (for example an adenovirus). The 
AAV recombinants which are produced are then purified by standard techniques. 

Retrovirus vectors. In another embodiment the gene can be introduced in a 

25 retroviral vector, e.g., as described in Anderson et aL, U.S. Patent No. 5,399,346; Mann et 
aL, 1983, Cell 33:153; Temin et aL, U.S. Patent No. 4,650,764; Temin et aL, U.S. Patent 
No. 4,980,289; Markowitz et aL, 1988, J. Virol. 62:1 120; Temin et aL, U.S. Patent No. 
5,124,263; EP 453242, EP178220; Bernstein et al. Genet. Eng. 7 (1985) 235; McCormick, 
BioTechnology 3 (1985) 689; International Patent Publication No. WO 95/07358, 

30 published March 16, 1995, by Dougherty et aL; and Kuo et aL, 1993, Blood 82:845. The 
retroviruses are integrating viruses which infect dividing cells. The retrovirus genome 
includes two LTRs, an encapsidation sequence and three coding regions (gag, pol and 
env). In recombinant retroviral vectors, the gag, pol and env genes are generally deleted, 
in whole or in part, and replaced with a heterologous nucleic acid sequence of interest. 

35 These vectors can be constructed from different types of retrovirus, such as, HIV, 

MoMuLV ("murine Moloney leukaemia virus 11 MSV ("murine Moloney sarcoma virus"), 
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HaSV ("Harvey sarcoma virus"); SNV ("spleen necrosis virus"); RSV ("Rous sarcoma 
virus") and Friend virus. Defective retroviral vectors are disclosed in WO95/02697. 

In general, in order to construct recombinant retroviruses containing a 
nucleic acid sequence, a plasmid is constructed which contains the LTRs, the 
5 encapsidation sequence and the coding sequence. This construct is used to transfect a 
packaging cell line, which cell line is able to supply in trans the retroviral functions which 
are deficient in the plasmid. In general, the packaging cell lines are thus able to express 
the gag, pol and env genes. Such packaging cell lines have been described in the prior art, 
in particular the cell line PA317 (US4,861,719); the PsiCRIP cell line (WO90/02806) and 

10 the GP+envAm-12 cell line (WO89/07150). In addition, the recombinant retroviral 

vectors can contain modifications within the LTRs for suppressing transcriptional activity 
as well as extensive encapsidation sequences which may include a part of the gag gene 
(Bender et al 9 J. Virol. 61 (1987) 1639). Recombinant retroviral vectors are purified by 
standard techniques known to those having ordinary skill in the art. 

1 5 Retroviral vectors can be constructed to function as infections particles or 

to undergo a single round of transfection. In the former case, the virus is modified to 
retain all of its genes except for those responsible for oncogenic transformation properties, 
and to express the heterologous gene. Non-infectious viral vectors are prepared to destroy 
the viral packaging signal, but retain the structural genes required to package the co- 

20 introduced virus engineered to contain the heterologous gene and the packaging signals. 
Thus, the viral particles that are produced are not capable of producing additional virus. 

Lentivirus vectors. In another embodiment, lentiviral vectors are can be 
used as agents for the direct delivery and sustained expression of a transgene in several 
tissue types, including brain, retina, muscle, liver and blood. The vectors can efficiently 

25 transduce dividing and nondividing cells in these tissues, and maintain long-term 
expression of the gene of interest. 

Lentiviruses contain at least two regulatory genes, tat and rev, that are 
essential for replication, and four accessory genes that encode critical virulence factors. 
[For a review, see Naldini, L., Curr. Opin. Biotechnol., 9:457-63, 1998]. The viral 

30 sequences non-essential for transduction are eliminated thereby improving the biosafety of 
this particular vector. Self-inactivating HIV-1 vectors are known, which have a deletion in 
the 3' long terminal repeat (LTR) including the TATA box, and significantly improve the 
biosafety of HIV-derived vectors by reducing the likelihood that replication-competent 
retroviruses will originate in the vector producer and target cells (Zufferey, et al, J. Virol., 

35 72:9873-80, 1998). In addition, the deletion improves the potential performance of the 

vector by removing LTR sequences previously associated with transcriptional interference 
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and suppression in vivo and by allowing the construction of more-stringent tissue-specific 
or regulatable vectors. 

Lentiviral packaging cell lines are available and known generally in the art. 
They facilitate the production of high-titer lentivirus vectors for gene therapy. An example 
5 is a tetracycline-inducible VSV-G pseudotyped lentivirus packaging cell line which can 
generate virus particles at titers greater than 10(6) IU/ml for at least 3 to 4 days (Kafri, et 
al, J. Virol., 73: 576-584, 1999). The vector produced by the inducible cell line can be 
concentrated as needed for efficiently transducing nondividing cells in vitro and in vivo. 

Non-viral vectors. Alternatively, the vector can be introduced in vivo by 

10 lipofection. For the past decade, there has been increasing use of liposomes for 

encapsulation and transfection of nucleic acids in vitro. Synthetic cationic lipids designed 
to limit the difficulties and dangers encountered with liposome mediated transfection can 
be used to prepare liposomes for in vivo transfection of a gene encoding a marker [Feigner, 
et. al., Proc. Natl. Acad. Sci. U.S.A., 84:7413-7417, 1987; see Mackey, et al, Proc. Natl. 

15 Acad. Sci. U.S.A., 85:8027-8031, 1988; Ulmere/a/., Science, 259:1745-1748, 1993]. 

The use of cationic lipids may promote encapsulation of negatively charged nucleic acids, 
and also promote fusion with negatively charged cell membranes [Feigner and Ringold, 
Science, 337:387-388, 1989]. Particularly useful lipid compounds and compositions for 
transfer of nucleic acids are described in International Patent Publications W095/ 18863 

20 and W096/17823, and in U.S. Patent No. 5,459,127. The use of lipofection to introduce 
exogenous genes into the specific organs in vivo has certain practical advantages. 
Molecular targeting of liposomes to specific cells represents one area of benefit. It is clear 
that directing transfection to particular cell types would be particularly advantageous in a 
tissue with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids 

25 may be chemically coupled to other molecules for the purpose of targeting [see Mackey, 
et. ah. supra]. Targeted peptides, e.g., hormones or neurotransmitters, and proteins such 
as antibodies, or non-peptide molecules could be coupled to liposomes chemically. 

Other molecules are also useful for facilitating transfection of a nucleic acid 
in vivo, such as a cationic oligopeptide (e.g., International Patent Publication 

30 W095/21931), peptides derived from DNA binding proteins (e.g., International Patent 
Publication WO96/25508), or a cationic polymer (e.g., International Patent Publication 
W095/21931). 

It is also possible to introduce the vector in vivo as a naked DNA plasmid. 
Naked DNA vectors for gene therapy can be introduced into the desired host cells by 
35 methods known in the art, e.g., transfection, electroporation, microinjection, transduction, 
cell fusion, DEAE dextran, calcium phosphate precipitation, use of a gene gun, or use of a 
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DNA vector transporter [see, e.g., Wu <?/ at, J. Biol. Chem., 267:963-967, 1992; Wu and 
Wu, J. Biol. Chem., 263:14621-14624, 1988; Hartmut et al, Canadian Patent Application 
No. 2,012,311, filed March 15, 1990; Williams et al, Proc. Natl. Acad. Sci. USA, 
88:2726-2730, 1991]. Receptor-mediated DNA delivery approaches can also be used 
5 [Curiel et al, Hum. Gene Ther., 3:147-154, 1992; Wu and Wu, J. Biol. Chem., 262:4429- 
4432, 1987]. US Patent No. 5,580,859 discloses delivery of exogenous DNA sequences, 
free of transfection facilitating agents, in a mammal. 

Antibodies against Smurf Proteins 
According to the invention, a Smurf polypeptide produced recombinantly or 

10 by chemical synthesis, and fragments, variants, or other derivatives or analogs thereof, 
including fusion proteins, may be used as an immunogen to generate antibodies that 
recognize a Smurf polypeptide. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and a Fab expression library. Such an 
antibody is specific for Smurf proteins; it may recognize a mutant form of a Smurf, or 

1 5 wild-type Smurf. These antibodies can be used to alter the BMP pathway by inhibiting a 
Smurf protein (e.g., anti-Smurf intracellular antibodies) or for diagnostic purposes. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to a Smurf polypeptide or derivative or analog thereof. For the 
production of antibody, various host animals can be immunized by injection with a Smurf 

20 polypeptide, or a derivative (e.g., fragment or fusion protein) thereof, including but not 
limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, a Smurf polypeptide 
or fragment thereof can be conjugated to an immunogenic carrier, e.g., bovine serum 
albumin (BSA) or keyhole limpet hemocyanin (KLH). Various adjuvants may be used to 
increase the immunological response, depending on the host species, including but not 

25 limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, 
surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human 
adjuvants such as BCG (Bacille Calmette-Guerin) and Corynebacterium parvum. 

For preparation of monoclonal antibodies directed toward a Smurf 

30 polypeptide, or fragment, analog, or derivative thereof, any technique that provides for the 
production of antibody molecules by continuous cell lines in culture may be used. These 
include but are not limited to the hybridoma technique originally developed by Kohler and 
Milstein (Nature 256:495-497, 1975), as well as the trioma technique, the human B-cell 
hybridoma technique (Kozbor et aL 9 Immunology Today 4:72, 1983; Cote et al, Proc. 

35 Natl. Acad. Sci. U.S.A. 80:2026-2030, 1983), and the EBV-hybridoma technique to 

produce human monoclonal antibodies (Cole et al ? in Monoclonal Antibodies and Cancer 
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Therapy, Alan R. Liss, Inc., pp. 77-96, 1985). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ- free animals (International 
Patent Publication No. WO 89/12690, published 28 December 1989). In fact, according to 
the invention, techniques developed for the production of "chimeric antibodies" (Morrison 
5 et al, J. Bacteriol. 159:870, 1984; Neuberger et al, Nature 312:604-608, 1984; Takeda et 
al, Nature 314:452-454, 1985) by splicing the genes from a mouse antibody molecule 
specific for a Smurf polypeptide together with genes from a human antibody molecule of 
appropriate biological activity can be used; such antibodies are within the scope of this 
invention. Such human or humanized chimeric antibodies are preferred for use in therapy 
1 0 of human diseases or disorders (described infra), since the human or humanized antibodies 
are much less likely than xenogenic antibodies to induce an immune response, in particular 
an allergic response, themselves. 

In accordance with the invention, techniques described for the production 
of single chain antibodies (U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. 
1 5 Patent 4,946,778) can be adapted to produce a Smurf polypeptide-specific single chain 
antibodies. An additional embodiment of the invention utilizes the techniques described 
for the construction of Fab expression libraries (Huse et al, Science 246:1275-1281, 1989) 
to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity for a Smurf polypeptide, or its derivatives, or analogs. 
20 Antibody fragments which contain the idiotype of the antibody molecule 

can be generated by known techniques. For example, such fragments include but are not 
limited to: the F(ab') 2 fragment which can be produced by pepsin digestion of the antibody 
molecule; the Fab' fragments which can be generated by reducing the disulfide bridges of 
the F(ab*) 2 fragment, and the Fab fragments which can be generated by treating the 
25 antibody molecule with papain and a reducing agent. Smurf activity can also be inhibited 
by expression of anti-Smurf intracellular antibodies, e.g., single chain Fv antibodies, using 
techniques known in the art (see generally Chen, Mol. Med. Today 3:160-167, 1997; Spitz, 
etal, Anticancer Res. 16:3415-3422, 1996; Indolfi etal^at Med. 2:634-635, 1996; 
Kijima et al, Pharmacol. Ther. 68:247-267, 1995). 
30 In the production of antibodies, screening for the desired antibody can be 

accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA (enzyme- 
linked immunosorbant assay), "sandwich" immunoassays, immunoradiometric assays, gel 
diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using 
colloidal gold, enzyme or radioisotope labels, for example), western blots, precipitation 
35 reactions, agglutination assays (e.g. , gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, protein A assays, and 
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immunoelectrophoresis assays, etc. In one embodiment, antibody binding is detected by 
detecting a label on the primary antibody. In another embodiment, the primary antibody is 
detected by detecting binding of a secondary antibody or reagent to the primary antibody. 
In a further embodiment, the secondary antibody is labeled. Many means are known in the 
5 art for detecting binding in an immunoassay and are within the scope of the present 

invention. For example, to select antibodies which recognize a specific epitope of a Smurf 
polypeptide, one may assay generated hybridomas for a product which binds to a Smurf 
polypeptide fragment containing such epitope. For selection of an antibody specific to a 
Smurf polypeptide from a particular species of animal, one can select on the basis of 
10 positive binding with a Smurf polypeptide expressed by or isolated from cells of that 
species of animal. 

The foregoing antibodies can be used in methods known in the art relating 
to the localization and activity of a Smurf polypeptide, e.g., for Western blotting, imaging 
a Smurf polypeptide in situ, measuring levels thereof in appropriate physiological samples, 
15 etc., using any of the detection techniques mentioned above or known in the art. Such 
antibodies can also be used in assays for ligand binding, e.g., as described in US Patent 
No. 5,679,582. 

In a specific embodiment, antibodies that agonize or antagonize the activity 
of a Smurf polypeptide can be generated. Such antibodies can be tested using the assays 
20 described infra for identifying ligands. 

Screening 

According to the present invention, nucleotide sequences derived from the 
gene encoding a Smurf, and peptide sequences derived therefrom, are useful targets to 
identify drugs that are effective in treating disorders mediated by the BMP signaling 

25 pathway, as discussed above (see, Schmitt et aL 9 J. Orthopedic Res., 17:269, 1999) can be 
used to agonize or antagonize a Smurf protein. Drug targets include without limitation (i) 
isolated nucleic acids derived from the gene encoding a Smurf and (ii) isolated peptides 
and polypeptides derived from Smurf polypeptides. 

In particular, identification and isolation of a Smurf protein provides for 

30 development of screening assays, particularly for high throughput screening of molecules 
that up- or down-regulate the activity of a Smurf, e.g., by permitting expression of a Smurf 
protein in quantities greater than can be isolated from natural sources, or in indicator cells 
that are specially engineered to indicate the activity of a Smurf protein expressed from a 
gene after transfection or transformation of the cells. Accordingly, the present invention 

35 contemplates methods for identifying specific ligands of a Smurf protein using various 
screening assays known in the art. 



42 

Any screening technique known in the art can be used to screen for agonists 
or antagonists of a Smurf protein. The present invention contemplates screens for small 
molecule ligands or ligand analogs and mimics, as well as screens for natural ligands that 
bind to and agonize or antagonize a Smurf protein in vivo. For example, natural products 
5 libraries can be screened using assays of the invention for molecules that agonize or 
antagonize Smurf activity- 
Knowledge of the primary sequence of Smurf 1 and Smurf2, and the 
similarity of that sequence with proteins of known function, provides one having ordinary 
skill in the art with useful information to determine inhibitors or antagonists of a Smurf 

10 protein. Identification and screening of antagonists is further facilitated by determining 
structural features of the protein, e.g., using X-ray crystallography, neutron diffraction, 
nuclear magnetic resonance spectrometry, and other techniques for structure 
determination. These techniques provide for the rational design or identification of 
agonists and antagonists. 

15 Another approach uses recombinant bacteriophage to produce large 

libraries. Using the "phage method" (Scott and Smith, Science 249:386-390, 1990; 
Cwirla, et al, Proc. Natl. Acad. Sci., 87:6378-6382, 1990; Devlin et al, Science, 49:404-' 
406, 1990), very large libraries can be constructed (10 6 -10 8 chemical entities). A second 
approach uses primarily chemical methods, of which the Geysen method (Geysen et al, 

20 Molecular Immunology 23:709-715, 1986; Geysen et al J. Immunologic Method 102:259- 
274, 1987; and the method of Fodor et al (Science 251:767-773, 1991) are examples. 
Furka et al (14th International Congress of Biochemistry, Volume #5, Abstract FR:013, 
1988; Furka, Int. J. Peptide Protein Res. 37:487-493, 1991), Houghton (U.S. Patent No. 
4,631,211, issued December 1986) and Rutter et al (U.S. Patent No. 5,010,175, issued 

25 April 23, 1991) describe methods to produce a mixture of peptides that can be tested as 
agonists or antagonists. 

In another aspect, synthetic libraries (Needels et al, Proc. Natl. Acad. Sci. 
USA 90:10700-4, 1993; Ohlmeyer et al, Proc. Natl. Acad. Sci. USA 90:10922-10926, 
1993; Lam et al, International Patent Publication No. WO 92/00252; Kocis et aL, 

30 International Patent Publication No. WO 9428028) and the like can be used to screen for 
Smurf ligands according to the present invention. 

Test compounds are screened from large libraries of synthetic or natural 
compounds. Numerous means are currently used for random and directed synthesis of 
saccharide, peptide, and nucleic acid based compounds. Synthetic compound libraries are 

35 commercially available from Maybridge Chemical Co. (Trevillet, Cornwall, UK), 

Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), and Microsource (New 
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Milford, CT). A rare chemical library is available from Aldrich (Milwaukee, WI). 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and 
animal extracts are available from e.g. Pan Laboratories (Bothell, WA) or MycoSearch 
(NC), or are readily producible. Additionally, natural and synthetically produced libraries 
5 and compounds are readily modified through conventional chemical, physical, and 
biochemical means (Blondelle et al., Tib Tech, 14:60, 1996). 

In vitro screening methods 
In one series of embodiments, an isolated nucleic acid comprising a Smurf 
gene is tested in vitro for its ability to bind test compounds in a sequence-specific manner. 
10 The methods comprise: 

(i) providing a nucleic acid containing a particular sequence corresponding 
to all or portions of a Smurf protein; 

(ii) contacting the nucleic acid with a multiplicity of test compounds under 
conditions appropriate for binding; and 

15 (iii) identifying those compounds that bind selectively to the nucleic acid 

sequence. 

Selective binding as used herein refers to any measurable difference in any 
parameter of binding, such as, e.g., binding affinity, binding capacity, etc. 

In one series of embodiments, an isolated peptide or polypeptide, or 
20 fragments thereof, comprising a Smurf protein is tested in vitro for its ability to bind test 
compounds in a sequence-specific manner. The screening methods involve: 

(i) providing a peptide or polypeptide, or fragment thereof, corresponding 
to Smurf protein or a fragment thereof; 

(ii) contacting the peptides with a multiplicity of test compounds under 
25 conditions appropriate for binding; and 

(iii) identifying those compounds that bind selectively to the peptides. 

In preferred embodiments, high-throughput screening protocols are used to 
survey a large number of test compounds for their ability to bind the genes or peptides 
disclosed above in a sequence-specific manner. 
30 In vivo screening methods 

Intact cells or whole animals expressing variants of a gene encoding a 
Smurf protein can be used in screening methods to identify candidate drugs. The 
following methods can be applied to normal or wild-type Smurf. 

In one series of embodiments, a permanent cell line is established from an 
35 individual exhibiting expression of a variant Smurf gene. Alternatively, cells (including 
without limitation mammalian, mammalian, insect, yeast, and bacterial cells) are 
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programmed to express a gene comprising one or more variant Smurf sequences by 
introduction of suitable vector. Identification of candidate compounds can be achieved 
using any suitable assay, including without limitation (i) assays that measure selective 
binding of test compounds to particular variant of a Smurf; (ii) assays that measure the 
5 ability of a test compound to modify (i.e., inhibit or enhance) a measurable activity or 
function of the Smurf; and (iii) assays that measure the ability of a compound to modify 
(i.e., inhibit or enhance) the transcriptional activity of sequences derived from the 
promoter (i.e., regulatory) regions of a Smurf gene. 

In another series of embodiments, transgenic animals are created in which 

10 (i) a human Smurf, having one or more mutations is stably inserted into the genome of the 
transgenic animal; and/or (ii) endogenous Smurf genes are inactivated and replaced with a 
variant human Smurf gene. See, e.g., Coffman, Semin. Nephrol. 17:404, 1997; Esther et 
al, Lab. Invest. 74:953, 1996; Murakami etal, Blood Press. Suppl. 2:36, 1996. A 
preferred method for creating such a transgenic animal is so called "knock-in" technology, 

1 5 where a human gene can be inserted to replace an endogenous gene under expression 

control of the endogenous genes' regulatory elements (see, Rodriguez et al, Cell, 97:199, 
1999). Such animals can be treated with candidate compounds and monitored for any 
alteration in the BMP and TGFp/activin pathways. 

Furthermore, populations that are not amenable to an established treatment 

20 for tissue and bone degeneration (e.g., osteoporosis) or enhancing regeneration can be 

selected for testing of alternative treatments. Moreover, treatments that are not as effective 
in the general population, but that are highly effective in the selected population, may be 
identified that otherwise would be overlooked. This is an especially powerful advantage 
of the present invention, since it eliminates some of the randomness associated with 

25 clinical trials. 

High-Throughput Screen 
Agents according to the invention may be identified by screening in high- 
throughput assays, including without limitation cell-based or cell-free assays. It will be 
appreciated by those skilled in the art that different types of assays can be used to detect 
30 different types of agents. Several methods of automated assays have been developed in 
recent years so as to permit screening of tens of thousands of compounds in a short period 
of time. Such high-throughput screening methods are particularly preferred. The use of 
high-throughput screening assays to test for agents is greatly facilitated by the availability 
of large amounts of purified polypeptides, as provided by the invention. 
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Reporter Genes 

Smad proteins have been identified as mediators of intracellular signal 
transduction by members of the transforming growth factor-beta (TGF-beta) superfamily, 
which affect cell proliferation, differentiation, as well as pattern formation during early 
5 vertebrate development. Following receptor activation, Smads are assembled into 

heteromeric complexes consisting of a pathway-restricted Smad and the common Smad4 
that are subsequently translocated into the nucleus where they are thought to play an 
important role in gene transcription. Smad Binding Elements (SBEs) containing the 
sequence CAGACA in the promoter of the JunB gene, is an immediate early gene that is 

10 potently induced by TGF-beta, activin, and bone morphogenetic protein (BMP) 2 (Jonk 
et al, J. Biol. Chem., 273:21 145, 1998). Two JunB SBEs are arranged as an inverted 
repeat, which is transactivated in response to Smad3 and Smad4 co-overexpression and 
shows inducible binding of a Smad3- and Smad4-containing complex in nuclear extracts 
from TGF-beta-treated cells. Smad proteins bind directly to the SBE. Multimerization of 

1 5 the SBE creates a powerful TGF-beta-inducible enhancer that is also responsive to activin 
and BMPs (see, e.g., Kim et al., Nature, 388:304, 1997). 

Reporter gene expression can be tied to expression or activation of any 
component of the BMP signaling pathway. Any reporter gene known in the art can be 
used. Thus, nucleotide sequences having the SBE sequence can be isolated using 

20 techniques known in the art and operably linked to a reporter gene to determine whether a 
Smurf, derivatives, and variants thereof, alter the activity of such promoter. In addition, 
other known genes in the BMP signaling pathway, such as T 1x2 or Smad7, which have 
promoters that are regulated by Smadl and 5 can be operably linked to a reporter gene. In 
these assays, candidate compounds can be tested for their ability to alter BMP of TGFp 

25 signaling in cells that express Smadl, Smad5, Smad7, Smurf 1 or Smurf2, or derivatives or 
variants thereof. 

Various reporter gene assays are known and can be used to evaluate the 
effects of a Smurf on a BMP or TGFp/activin signaling pathway. Reporter genes include 
luciferase, p-galactosidase (p-gal or lac-Z), chloramphenicol acetyltransferase (CAT), 

30 horseradish peroxidase, and alkaline phosphatase. In addition, expression of almost any 
protein can be detected using a specific antibody. For example, a green fluorescent protein 
(GFP) expression assay permits evaluation of Smad-induced SBE activity. GFP has been 
modified to produce proteins that remain functional but have different fluorescent 
properties. Several U.S. patents teach use of GFP to visualize signaling pathways linked 

35 to the reporter gene by an inducible promoter. For example, see U.S. Patent No. 5,625,048 
(modified GFP resulting in amino-acid changes provides visibly distinct colors and 
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increased intensities of emission), W096/23898 (construct encoding a modified GFP 
which also contains an enzyme recognition site), W097/1 1094 (fluorescent proteins with 
increased intensity), W097/266333 (humanized GFP protein optimized to provide higher 
levels of expression in mammalian cells), WO97/42320 (modified GFP having increased 
5 intensity of fluorescence, WO98/06737 (modified GFP easily distinguished from green 
and blue fluorescent proteins), W098/21355 (GFP mutants excitable using blue and white 
light). 

Screening Kits 

The components required to practice the screening methods described 

1 0 above 

can be prepared in kit form, for the convenience of the user. Such kits are preferably 
adapted for use in an automated screening apparatus. 

Identification of Smurfl Binding Partners 

In still another embodiment, the present invention provides for 
15 identification of Smurf binding partners, in addition to Smadl, Smad7 and Smad5, which 
can then be analyzed for mutations that lead to diseases mediated by the BMP or 
TGFp/activin pathways. One method for identifying such binding partners is a yeast two 
hybrid assay system, preferably using a hematopoietic stem cell library with yeast that are 
transformed with a recombinant Smurf Alternatively, a Smurf protein can be used to 
20 affinity purify proteins from cell preparations, e.g., using cells that endogenously produce 
Smurfs. Partially purified preparations can be probed with a labeled Smurf protein to 
identify specific binding partners, e.g., in a Western-type or other antibody-assay type of 
system (see the description above of antibodies for examples of such assays; naturally, any 
protein can be labelled as an antibody and its binding to a binding partner evaluated). 

25 EXAMPLES 

The present invention may be better understood by reference to the 
following examples, which are not intended to limit the invention. 

EXAMPLE 1 

Smurfl targets the BMP pathway and affects embryonic pattern 
30 formation. The TGFp superfamily regulates diverse biological processes including cell 

growth, differentiation and pattern formation. Any misregulation of TGFP signals may 

cause disease. Signals from activated TGFp receptors are directly transduced from 

receptor to nucleus by Smad proteins. Currently, a few links have been established 



W« V^r '"£L" . ib 5 ;>/' 3 'C' 



47 

between ubiquitin-mediated degradation and the modulation of morphogenetic signaling 
during development. This Example describes a new E3 ubiquitin ligase, Smurfl, that 
selectively interacts with BMP pathway-specific Smads to trigger their ubiquitination, 
degradation and loss of activity. In amphibian embryos, Smurfl specifically blocked BMP 
5 signals and affected pattern formation. Thus, targeted ubiquitination of Smads may 

function to control both embryonic development and a wide variety of cellular responses 
to TGFp signals. 

Methods 

Yeast two-hybrid screen. A Xenopus Smadl cDNA (36) was cloned into 

10 the pGBT9 vector and used to screen a Xenopus oocyte cDNA library (Clonetech) by the 
yeast two-hybrid method (46) using Xenopus Smadl as the bait protein. A partial cDNA 
was isolated and used to screen a Xenopus Stage9 (blastula) cDNA library to obtain a full 
length Smurfl cDNA [SEQ ID NO: 1]. A human Smurfl cDNA encoding all but the first 
8 amino acids was identified in the EST database (AA292123), and was used to construct 

1 5 human Smurfl . The first 8 amino acids were reconstituted using the corresponding 

Xenopus Smurfl cDNA sequence. hSmurf cDNA was FLAG tagged at its N-terminus. 
The ubiquitin ligase mutant version of hSmurfl was generated by replacing cysteine 710 
with alanine using a PCR-based approach and the mutation was confirmed by sequencing. 

Co-immunoprecipitation. For immunoprecipitation assays Xenopus Smadl 

20 (36), mouse Smad4, and human Smad2 (47), were FLAG-tagged at their C-termini and 
translated in vitro (rabbit reticulocyte extracts; Promega) in the presence of 35 S-Met. The 
FLAG-tagged Smads were bound to anti-FLAG antibody-conjugated beads (Kodak), 
washed in co-IP buffer (lOmM Tris, pH 7.5, 90 mM NaCl, 1 mM EDTA, 1% Triton-X 
100, 10% glycerol, 1 mM phenylmethylsulfonylflouride) then incubated with 

25 35 S-Met-labelled Smurfl in the same buffer. After washing in co-IP buffer and elution in 
gel loading buffer, proteins were separated by SDS-PAGE and visualized by 
autoradiography. 

Embryo methods and RT-PCR. Embryo production, preparation and 
injection of synthetic mRNA into embryos, animal cap and ventral marginal zone assays, 
30 embryonic RNA isolation, embryo fixation and whole mount in situ hybridization, and 
developmental RT-PCR were performed as described (36, 48). Primers for Smurfl 
RT-PCR were 5'-GTCCTGTGACTGGAACCC-3' (sense) [SEQ ID NO: 5] and 
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5'-.GAGGACTGCTAGACAAT-3' (antisense) [SEQ ID NO: 6], whose 5 1 ends are 
respectively located at positions 482 and 726 in the Smurf 1 cDNA. 

mSmurfl. Northern blots were performed of mSmurfl expression in 
embryonic and adult mouse tissues. Equal amounts of PolyA+ mRNA from embryonic 
5 tissue 7, 1 1 and 15 days post coitum and tissue samples from testes, kidney, skeletal 
muscle, lung, spleen, brain and heart were analyzed. Cytoskeletal actin expression was 
assayed on the same blot and verified that mRNA loading was equal in all lanes. 
Whole-mount in situ hybridization on mouse embryos were also performed. 

Immiinoprecipitations and Immunoblotting. COS-1 and 293T cells were 

10 transiently transfected using lipofectAMINE (GibroBRL) and calcium phosphate 

precipitation methods, respectively (49). Immunoprecipitations and immunoblotting were 
performed as described previously (50) using anti-Flag M2 monoclonal antibody (Sigma), 
anti-Smadl/5 polyclonal antibody (50) or anti-WW2 Nedd4 (51) polyclonal antibody. 
Detection was achieved using the appropriate HRP conjugated goat anti-mouse or goat 

15 anti-rabbit secondary antibodies and enhanced chemiluminescence (Amersham). 

Pulse Chase Analysis. COS-1 cells were transfected as indicated above. 
Two days post-transfection the cells were labelled for 10 min. at 37° C with 50 \xC\ 
[ 35 S]-methionine/ml in methionine-free DMEM (Pulse). Cell layers were then washed two 
times and incubated in DMEM+10% FCS for the indicated time periods (Chase). At each 

20 time point of the chase, cell lysates prepared in TNTE lysis buffer (50 mM Tris/HCl, pH 
7.4, 150 mM NaCl, 0.5% Triton X-100 and ImM EDTA) containing protease and 
phosphatase inhibitors were subjected to immunoprecipitations using an anti-Smadl 
polyclonal antibody. Immune complexes were resolved by SDS-PAGE and visualized by 
autoradiography. A Phosphorimager (Molecular Dynamics) was used to quantitate the 

25 amount of metabolically labelled Smadl present at each time point. 

Ubiquitination Assay. 293T cells were transfected with HA-tagged 
ubiquitin, untagged-Smadl, and either Flag-hSmurfl or Flag-hSmurfl (C710A) as 
indicated above. Two days post-transfection, cells were lysed and subjected to a-Smadl 
immunoprecipitation. The immunoprecipitates were then washed sequentially two times 

30 each in TNTE + 0.1% triton, SDS-RIPA (TNTE lysis buffer, 0.1% sodium dodecyl sulfate 
and 1% deoxycholate), and 500 mM LiCl, 50 mM Tris/HCl, pH 7.4 and 0.1% trition. The 
presence of HA-ubiquitinated Smadl in the immune complexes was visualized by 
SDS-PAGE followed by immunoblotting with the monoclonal anti-HA 12CA5. Protein 
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levels of untagged Smadl, Flag-hSmurfl and Flag-hSmurfl (C710A) were analyzed by 
immunoblotting aliquots of total cell lysates with the appropriate antibodies. 

Results 

Isolation of an E3 ubiquitin ligase that interacts with the BMP signal 
5 transducer Smadl. To isolate factors that interact with and potentially regulate Smadl we 
performed a yeast two hybrid screen using Xenopus Smadl as the bait protein. Several 
overlapping cDNAs were isolated that encoded a protein with significant homology to the 
Hect subclass of E3 ubiquitin ligases (22). This novel gene and a closely related human 
homolog, Smurfl and hSmurfl, respectively (for Smad ubiquitination regulatory factor- 1), 

10 are proteins of 731 amino acids in length, share 91% sequence identity and contain a Hect 
ubiquitin-ligase domain in the C-terminal portion of the molecule. This domain is 
characteristic of a specific class of E3 ubiquitin ligases (22) that include in mammals, 
E6-AP and Nedd4 and in yeast, RSP5p and Publ (23-26). 

Identification of human Smurfl genomic location. The human Smurfl 

1 5 genomic clone is located in PAC clone DJ0808A01 , from chromosomal region 
7q21.1-q31.1, located from nucleotides 2669.. 53763. A computer prediction of 
intron/exon splicing products generated a single conceptual mRNA with open reading 
frame directly corresponding to the actual cloned human cDNA. 

E3 ubiquitin ligases function together with El and E2 enzymes to conjugate 

20 ubiquitin to specific protein substrates, which targets the protein for subsequent 

degradation by the proteasome (12). El enzymes recruit and activate ubiquitin, which is 
then transferred to E2, a ubiquitin conjugating enzyme. The E2 in turn attaches ubiquitin 
either directly to the protein to be modified, or transfers ubiquitin to the E3 ubiquitin 
ligase, which confers substrate selectivity to the ubiquitination complex. By functioning to 

25 recruit specific target proteins to the conjugation machinery, E3 activity provides 

selectivity to the ubiquitination process. E3 activity can be provided by structurally and 
functionally diverse proteins. For instance, E3s may exist as multimeric protein 
complexes that facilitate substrate recognition, but may or may not possess direct ligase 
activity. Examples of this include the N-end rule E3s, and the Skpl/Cullin/F-box (SCF) 

30 complexes (27-29). In the case of the Hect family of E3 ubiquitin ligases, of which 
Smurfl is a member, a single protein provides both substrate specificity and catalytic 
activity (22, 30). 
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Smurf 1 also displays several other structural features characteristic of 
RSP5, Nedd4 and Publ ubiquitin ligases. This includes an amino-terminal phospholipid 
and calcium binding C2 domain, as well as two WW* domains, which facilitate 
protein-protein interactions by binding to PPXY motifs on partner proteins (31-33). 
5 Overall, Smurf 1 is most closely related to Publ (Fig. 1), a ubiquitin ligase from 

Sacromyces pombe that regulates mitosis by targeting cdc25 for proteosomal degradation 
(23). 

Expression of mSmurfl in embryonic and adult tissue. Northern blots of 
mSmurf 1 showed that a transcript of about 6 kb was the predominant form of Smurfl 

10 expressed in embryos (Fig. 2A). In tissues, major transcripts of 3.0 and 6.0 kb were 
observed (Fig. 2B). Cytoskeletal actin expression was assayed on the same blot and 
verified that mRNA loading was equal in all lanes. In the mouse, mSmurf 1 was present in 
very early development, from at least embryonic day 7 onward. In adults, mSmurfl was 
expressed in most tissues assayed, but there were differences in the abundance of the 

1 5 transcripts. We did not observe detectable mSmurfl transcripts in the skeletal muscle, and 
the kidney and spleen expressed low levels of the large transcript. The small transcript 
was very abundant in the testes, and it was very weakly or not at all expressed in the 
kidney, lung, spleen, brain and heart. Both blots were prepared by loading equal amounts 
of polyAH- RNA in each lane, but to account for any variation in RNA loading and transfer, 

20 we probed the blot with a tubulin probe, and found no substantial differences among lanes. 
Thus, the variation seen in the blots reflected the relative expression of mSmurfl in 
embryos and tissues. 

Whole-mount in situ hybridization on mouse embryos revealed Smurfl 
expression in the nervous system, eye, branchial arches, axial mesoderm (notochord), 

25 somites, and limb buds from days p.c. 10 - 14. 

Localization ofXenopus Smurfl to the egg animal pole and embryonic 
ectoderm. In Xenopus embryonic development, Smurfl mRNA was found expressed from 
egg through swimming tadpole stages, with maximum levels observed early in 
development at egg, blastula, and gastrula stages. Smurfl levels declined sharply near the 

30 end of gastrulation, yet zygotic expression persisted at a reduced level into late swimming 
tadpole stages (Fig. 3A). Whole mount in situ hybridization (Fig. 3B) revealed that 
maternal Smurfl transcripts were localized to the animal pole half of eggs and cleaving 
blastulae, which was confirmed by a northern blot analysis on RNA isolated from animal 
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and vegetal halves of middle blastula stage embryos (not shown). At gastrulation, Smurfl 
expression became more widespread in the embryo, coincidently with the decline in its 
transcript levels observed by RT-PCR. During neural plate closure, however, Smurfl 
expression was localized to the developing nervous system, and by tadpole stages it was 
5 highly expressed in the central nervous system, eye, pharyngeal pouches and somites. The 
embryonic expression pattern of Smurfl partially overlaps with the expression of Smadl 
and BMP-4 in the ectoderm at blastula and gastrula stages and the nervous system, eye, 
somites and pharyngeal pouches at tadpole stages (34-36). 

Expression of hSmurfl leads to selective reduction in the steady-state 

10 protein level of Smadl and Smad5 in mammalian cells. Smurfl contains a putative E3 
ligase or Hect domain, and therefore was a candidate protein for interacting with Smadl. 
To investigate whether Smurfl regulates steady-state levels of Smadl protein, two 
mammalian cell lines, 293T and COS-1, were transfected with Smadl alone or with 
increasing amounts of Flag-hSmurfl . The steady-state levels of Smadl protein were then 

15 evaluated by immunoblots of whole cell lysates. Smadl was readily detectable in the 
absence of hSmurfl (Fig. 4 A). However, expression of hSmurfl, even at low levels not 
readily detectable by Western blotting, produced a significant, dose-dependent decrease in 
Smadl protein levels. At the highest levels of hSmurfl expression there was no detectable 
Smadl protein. Furthermore, co-expression of a constitutively activated form of the BMP 

20 type I receptor ALK6, which phosphorylates Smadl (37, 38), did not alter 

hSmurfl -dependent decreases in Smadl levels (Fig. 4B). Thus, expression of hSmurfl 
causes dose-dependent decreases in steady- state levels of Smadl . This type of action can 
occur independently of activation of Smadl by the type I BMP receptor. 

To investigate whether Smurfl activity is exclusive to Smadl, we 

25 investigated its effects on Smad2, a receptor-regulated Smad that functions in TGF0 and 
activin signaling pathways. Unlike Smadl, which was sensitive to the lowest doses of 
hSmurfl , there was only a slight effect on steady-state levels of Smad2 protein, which 
occurred only at the highest levels of Flag-hSmurfl expression (Fig. 4C). Other Smads 
were tested: Flag-hSmurfl had little or no effect on Smad3 or Smad4 protein levels, but it 

30 elicited a strong decrease in Smad5 protein, which is closely related to Smadl (Fig. 4D). 
Together, these data demonstrate that hSmurfl preferentially regulates the steady-state 
levels of Smadl and Smad5, two receptor-regulated Smads that function in BMP 
signaling. 
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hSmurfl regulates Smadl degradation and ubiquitination. The inclusion 
of a putative E3 ligase or Hect domain supported the prediction that Smurfl functions as 
an E3 ubiquitin ligase. To investigate whether Smurfl regulates Smad degradation, 
studies focused on Smadl (91% homology with Smad5). Analysis of Smadl turnover by 
5 pulse-chase experiments revealed that in the absence of hSmurfl, Smadl had a half life of 
approximately 6 hours. However, in the presence of hSmurfl, Smadl turnover was 
significantly enhanced (half life of less than 2 hours) (Fig. 5 A). Thus, hSmurfl increases 
the rate of Smadl turnover. 

To determine the mechanism of hSmurfl -mediated turnover of Smadl, 

10 Smadl ubiquitination in intact cells was assessed. To facilitate detection of ubiquitin, 
293T cells were transfected with HA-tagged ubiquitin together with Smadl, in the 
presence or absence of Flag-hSmurf 1 . In the absence of hSmurfl, Smadl displayed little 
or no detectable ubiquitination. However, upon co-transfection with hSmurfl we 
observed the appearance of a ladder of ubiquitin-conjugated Smadl (Fig. 5B). To confirm 

15 that ubiquitination of Smadl required the catalytic activity of the Hect domain in hSmurfl, 
a point mutant in hSmurfl (hSmurfl (C71 OA)) was constructed. This residue is critical for 
the catalytic activity of the Hect domain and the mutation is thought to target the cysteine 
residue that forms a thiolester bond with ubiquitin (22). In contrast to wildtype hSmurfl, 
expression of hSmurfl (C710A) did not yield ubiquitinated Smadl. Moreover, hSmurfl 

20 (C710A) did not affect Smadl steady-state protein levels compared to wildtype hSmurfl, 
despite efficient expression of the mutant protein (Fig. 5C). Together, these data suggest 
that hSmurfl alters Smadl steady-state levels by inducing ubiquitin-mediated degradation 
of Smadl via its Hect domain. 

Smurfl interacts selectively in vivo with Smadl and Smad5. The 

25 interaction of Smurfl with Smad proteins was investigated to assess the basis for the 

selective targeting of Smadl and 5 for degradation by Smurfl . The yeast 2-hybrid assay 
was first used to test the ability of Xenopus Smurfl to interact with Smadl, Smad4 or the 
non-specific control protein nuclear lamin. Yeast co-transfected with Smurfl and Smadl 
exhibited significant p-galactosidase activity; yeast co-transfected with Smurfl and either 

30 Smad4 or lamin did not demonstrate P-galactosidase activity (Fig. 6 A, left). Smurfl 

selectively bound to Smadl, but not to Smad2 or Smad4 (Fig. 6A, right) as demonstrated 
by the capacity of 35 S-labelled Smurfl to bind and immunoprecipitate Smads in vitro. 



;i O O O s-f-«3 , 55 1 © S 



53 

To test the capacity of 35 S-labelled Smurfl to bind and immunoprecipitate 
Smads in vitro the specificity of Smurfl -Smad interactions in intact cells, the association 
of hSmurfl with various Smads in 293T cells was investigated. Wild type hSmurfl did 
not detect interactions with Smadl. However, it is possible that in intact cells 
5 Smurfl -Smadl interactions are transient in nature, since demonstrating association 
between ubiquitin ligases and their substrates has proven difficult in other systems. By 
examining Smadl interaction with the ubiquitin-ligase mutant Flag-hSmurfl (C710A), 
association of the proteins could be detected (Figs. 6B, C). Furthermore, this interaction 
was unaffected by coexpression of the constitutively active BMP-type I receptor, ALK2 

10 (data not shown), and is consistent with the notion that hSmurfl regulates Smadl turnover 
independent of BMP signaling. hSmurfl (C710A) also bound efficiently to Smad5, but 
analysis of association with Smad2 revealed little, if any, interaction (Figs. 6A and B). 

The linker regions of R-Smads contain a PPXY sequence, which is a 
conserved motif recognized by WW domains such as those found in Smurfl . Unlike wild 

1 5 type Smadl , a mutant of Smadl , in which the PY motif was deleted, associated only 

weakly with Smurfl and was resistant to Smurfl -mediated degradation (data not shown). 
These results show that hSmurfl associates specifically with Smadl and Smad5, which 
interaction is mediated by binding between the PY motif in Smadl and the WW domains 
of Smurfl . 

20 The specificity of hSmurfl as a ubiquitin ligase in targeting the Smads, was 

further tested by comparing it to Nedd4, a structurally related ubiquitin ligase. While 
Smadl co-precipitated efficiently with hSmurfl (C710A), it did not interact with the 
corresponding Nedd4 mutant. Consistent with this, overexpression of Nedd4 did not 
affect the steady-state level of Smadl protein (Fig. 6C, lower panel). Together, these data 

25 indicate that both human and Xenopus Smurfl proteins associate selectively with 

BMP-regulated Smadl and Smad5, and that this interaction is specific to Smurfl rather 
than a general feature of the Hect class of ubiquitin ligases. 

Smurfl antagonizes endogenous BMP signals in Xenopus embryos to 
dorsalize ventral mesoderm and neuralize ectoderm. In the Xenopus blastula, Smurfl 

30 expression is localized to the animal pole ectoderm and partly overlaps the marginal zone, 
i.e., a belt of cells located at the equator of the blastula which forms the mesoderm. This 
pattern of expression, considered together with the ability of Smurfl to interact with, 
ubiquitinate, and degrade BMP pathway-specific Smads, suggests that Smurfl may 



A O O O -$ : M! 5 - O *F, S.:I D S- 

54 

function in ectodermal and mesodermal patterning by antagonizing BMP signals through 
Smadl or SmadS. 

Presently, naturally-occurring inhibitors of TGFP activities have been 
described at the ligand leveL Those factors, which include chordin, follistatin and noggin, 
5 act outside of cells by binding particular ligands, including BMPs and activin. During 
Xenopus mesoderm induction and patterning, BMP signals in the ventral part of the 
marginal zone specify development of tissues, such as blood and mesenchyme, that are 
characteristic of the ventral region of the tadpole. However, if BMP signaling is blocked 
by ligand antagonists such as chordin, follistatin, or noggin (which are secreted by the 
10 dorsal Spemann organizer), or by artificial inhibitors such as dominant negative BMP 

ligands or receptors, the prospective ventral mesoderm will differentiate into dorsal tissues 
such as muscle and notochord (a process referred to as "dorsalization") (5, 39). 

Smurf 1 has the potential to regulate BMP signals in the marginal zone, so 
ectopic Smurfl expression was tested for interference with ventral tissue patterning. 
1 5 Smurfl mRNA was microinjected into 4 cell blastula stage embryos in the ventral 

marginal zone (VMZ) to trigger the formation of ectopic secondary axial structures in the 
ventral region of 52% of tadpoles (n = 25). These secondary axial structures induced by 
Smurfl were characteristic of dorsalization caused by BMP inhibition. Their formation 
was rescued by co-expression of Smadl together with Smurfl, demonstrating that the 
20 effect of Smurfl was limited to interference with the BMP/Smadl pathway (Fig. 7A). 

Furthermore, overexpression of Smurfl in cells of the dorsal marginal zone (DMZ), which 
forms head and dorsal axial structures, had no effect (data not shown). This latter 
observation was consistent with previous findings of the invention that Smurfl does not 
target Smad2 in cultured cells. 
25 The dorsalizing effects of Smurfl using VMZ explants was further 

characterized. In this case, Smurfl expression caused reduced blood differentiation and 
concomitant muscle differentiation, consistent with the dorsalizing effect of Smurfl (Fig. 
7 A, right panel). As in the axis formation assay, co-expression of Smadl with Smurfl 
reversed these dorsalizing effects and demonstrated BMP pathway specificity. 
30 In the ectodermal germ layer, endogenous BMP expression specifies 

epidermis, but when BMP signals are reduced or eliminated the ectoderm differentiates 
into cement gland or neural tissue, respectively (40, 41). The localization of Smurfl 
mRNA in the ectoderm of the Xenopus blastula and early gastrula suggested that Smurfl 
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may regulate ectodermal patterning by modulating BMP signaling. Therefore, Smurfl was 
tested for .its presence in ectodermal tissue. It was found that overexpression of Smurfl in 
animal caps triggers neural and cement gland differentiation, which is characteristic of a 
reduction in BMP signaling. These effects were reversed by co-expression of Smadl in 
5 the animal cap (Fig. 7B). Together these results demonstrate that Smurfl can block BMP 
signals in the ectoderm and mesoderm, suggesting that Smurfl decreases BMP signaling 
in these tissues to affect embryonic patterning. 

Smurfl inhibits Smadl activity but potentiates Smad2 activity in embryonic 
cells. In cultured cells, Smurfl triggers ubiquitination and degradation of BMP 

10 pathway-specific Smads, and in Xenopus embryos Smurfl antagonizes endogenous BMP 
signals. In Xenopus animal caps overexpression of Smads mimics the effects of TGFp 
factors that signal through specific R-Smads. Thus, Smadl induces exclusively 
ventral/posterior mesoderm, like BMP ligands, while Smad2 induces dorsal (Spemann 
organizer) mesoderm, like activin, Vgl and nodal. We therefore tested whether Smurfl 

1 5 can directly antagonize the mesoderm induction activities of Smadl or Smad2 by 

overexpressing each Smad together with various doses of Smurfl in animal caps. We 
found that expression of Smadl alone (1 ng mRNA) induced ventral mesoderm, as 
demonstrated by expression of the ventral/posterior mesodermal markers Xhox3 and 
Xcadl. However, co-expression of Smurfl and Smadl blocked induction of these markers 

20 at all Smurfl doses tested (Fig. 8 A), demonstrating that Smurfl can antagonize Smadl 
activity. 

To determine whether Smurfl can interfere with Smad2, a dose of Smad2 
(50 pg mRNA) was used in an amount sufficient to induce myoD (a vertebrate muscle 
marker of dorsal/lateral mesoderm), but insufficient to induce goosecoid (a frog marker for 

25 dorsal mesoderm of the Spemann Organizer) (Figs. 7B, C). When Smurfl was 

co-expressed with Smad2 there was no inhibition of myoD induction at any of the Smurfl 
doses tested (Fig. 8B). This was consistent with other findings that Smurfl does not target 
Smad2. Interestingly, as the dose of Smurfl was increased in the presence of this limiting 
amount (50 pg) of Smad2, induction of goosecoid gene expression was observed. This 

30 induction was dependent on Smad2 expression, since Smurfl alone did not induce 

goosecoid. The response of cap cells to increasing amounts of Smurfl in the presence of 
limiting Smad2 had similar effects to increasing the dose of Smad2 alone (Fig- 8C). Thus, 
a combination of 50 pg Smad2 and 100 pg Smurfl induced goosecoid expression to a level 
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equivalent to a 5-fold higher (250 pg) dose of Smad2 alone (Fig. 8C). Thus, rather than 
inhibit Smad2 activity, Smurfl appears to enhance the sensitivity of the animal cap to 
Smad2. These experiments demonstrate that in addition to blocking the response of 
animal pole cells to the BMP pathway, Smurfl enhances these cells' response to the 
5 activin pathway. Although the invention is not dependent on any particular mechanism, 
the inventors propose that the Smurfl -mediated shift in responsiveness of animal pole cell 
results from lowered endogenous BMP signaling caused by targeted ubiquitination of the 
Smurfl substrates, Smads 1 and 5. Therefore, Smurfl functions during development to 
alter the competence of cells to respond to multiple TGFp ligands by selective inactivation 
10 of a particular Smad pathway. 

Discussion 

The findings of the inventors provide a clear role, pathway and mechanism 
for selective ubiquitination in regulating developmental patterning. The Example 
presented evidence that TGFp signaling can be controlled by ubiquitination of Smad 

15 signal transduction molecules, and that this activity has developmental consequences. 

Specifically, it was shown that Smurfl, a Hect family E3 ubiquitin ligase, selectively and 
directly interacts with BMP pathway-specific Smads, Smadl and Smad5, triggering their 
ubiquitination and degradation. Smurfl does not, however, interact with or affect 
TGFp/activin pathway-specific Smad2, nor the common partner in Smad signal 

20 transduction complexes, Smad4. Furthermore, targeting of Smads by Hect ubiquitin 
ligases is not a general characteristic of E3 ligases since Nedd4 does not interact with 
Smads. Interestingly, the results showed Smurfl -mediated turnover of Smads is not 
affected by BMP signaling, suggesting that activated forms of Smadl or SmadS are not 
required to serve as Smurfl substrates. Thus, Smurfl does not act downstream of 

25 activated Smads to turn off BMP signals, but rather controls the competence of cells to 
respond to BMPs by regulating the steady state level of Smad protein in the cell. 

The mSmurfl gene is expressed in early development, where we predict it 
will regulate the response of cells to BMP signals. Smurfl is present in adult tissues 
implying mSmurfl regulates late stage BMP signals. Perhaps these signals act in tissue 

30 stasis, or perhaps ongoing differentiation associated with growth. To a large degree the 
expression in developing mouse at neurula stage and thereafter is in good accord with the 
expression patterns seen in the frog embryo at similar stages. 
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The phenotypic effects of Smurfl in Xenopus embryos point to selective 
ubiquitination as an important regulator of inductive signals during embryonic 
development. Smurfl dorsalizes mesoderm and neuralizes ectoderm by interfering with 
BMP signals that control patterning of these germ layers in normal development. 
5 Presently, regulation of BMP-dependent patterning is considered to be accomplished by 
secreted BMP-binding proteins, such as chordin, noggin and follistatin, which inhibit BMP 
signals extracellularly by direct binding to BMP ligands. However, Smurfl provides a 
new mechanism to regulate BMP signals, which acts at the level of signal transduction, 
and presumably functions cell autonomously because Smurfl is an intracellular protein. 

10 Furthermore, unlike secreted BMP binding proteins, which have restricted ligand 

specificity, Smurfl provides broad inhibitory activity for BMP pathways because many 
type I receptors (ALK1, ALK2, ALK3, and ALK6) signal through Smadl and Smad5 (42). 
Thus, in the developing embryo Smurfl cooperates with extracellular BMP inhibitors to 
specify pattern formation. Whether Smurfl is regulated in some manner remains an open 

1 5 question. 

Of particular interest is the localization of Smurfl mRNA to the animal 
pole of the Xenopus egg and blastula. This region forms ectodermal tissues such as 
epidermis, cement gland and the nervous system, tissue fates which are controlled by the 
level of BMP signaling perceived by the cells. When animal pole ectoderm is deprived of 

20 endogenous BMP signals neural differentiation occurs spontaneously, but as the levels of 
BMP ligand applied to cells, or the amount of Smadl expressed within cells, are gradually 
increased, a spectrum of cell fates are progressively specified from neuronal, through 
cement glad then epidermis. Furthermore, reduced BMP signaling in the prospective 
ventral mesoderm of Xenopus embryos causes that tissue to be re-specified as dorsal 

25 mesoderm. Since Smurfl can regulate cell responsiveness to BMP signals it may function 
in both ectodermal and mesodermal fate specificity by controlling the level of Smadl or 
Smad5. High Smurfl levels may completely shut off the ability of a cell to respond to 
BMPs, but at intermediate or low levels Smurfl may modulate the magnitude of the BMP 
signal through the control of Smadl protein levels to alter the nature of the cellular 

30 response. Consequently, Smurfl may control cell fate determination in response to BMPs 
by establishing an intracellular morphogenetic gradient of Smadl activity. In a variety of 
animals maternal mRNAs localized in eggs function as determinants of cell fate (43). In 
Xenopus, mRNAs encoding Vgl (a TGFp member) and Brat/VgT are localized to the egg 
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vegetal pole where they specify endoderm and mesoderm (44, 45). The results presented 
in this Example suggest that localized Smurf 1 in the egg animal pole may function as a 
determinant of ectodermal cell fate. 

An intriguing consequence of Smurf 1 overexpression in Xenopus animal 
5 caps is altered cell competence to respond to Smad2, the activin/TGFp pathway specific 
Smad. Smurf 1 enhances the sensitivity of cells to a fixed level of ectopically expressed 
Smad2, so as Smurfl levels increase, the mesoderm that is induced becomes progressively 
more characteristic of the'Spemann organizer, the most dorsal type of mesoderm. This 
response mimics the effects of what happens when the levels of Smad2 alone are elevated 
10 in animal caps. Thus, Smurfl can simultaneously change the competence of cells to 
respond to different TGFp signals: it inhibits responses to the Smadl pathway while 
stimulating responses to the Smad2 pathway. The distinct effects of Smurfl on different 
TGF|3 signaling pathways has important consequences for embryonic development, cell 
growth, tissue stasis and other biological functions regulated by the TGFP superfamily. 

1 5 EXAMPLE 2: Characterization and Cloning of human Smurfi 

Human Smurfl was identified and cloned using a Xenopus Smurfl 
sequence in an EST database. Two overlapping EST clones corresponding to hSmurf2 
were obtained and used to construct a full length sequence for hSmurf2. Smurf2 is closely 
related to Smurfl, displaying approximately 75% homology to the amino acid sequence of 

20 hSmurf 1 . Smurf2 contains a C2 domain at the amino-terminus, followed by three WW 
domains and a HECT ubiquitin ligase domain (Fig. 12). Smurf2 is closely related to 
Smurfl, but possesses an extra WW domain downstream of the C2 domain (Fig. 13). 
Smurf2 was found to be expressed throughout early development and was present in most 
adult tissues, with lower levels in spleen and skeletal muscle (Figs.l4A and 14B). RT- 

25 PCR analysis further revealed that Smurf2 is expressed in a variety of cell lines that 
include PI 9, HepG2 and 293T. 

Isolation of Human and mouse Smurfl, Several overlapping human clones 
displaying similarity to Smurfl were identified from the expressed sequence tag (EST) 
database and a full-length version of Smurf2 was constructed by PCR using two 

30 overlapping EST clones. For northern blot analysis, a partial mouse Smurf2 cDNA clone 
.encoding 225 amino acids of open reading frame including the stop codon and displaying 
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96% amino acid identity to human Smurf2 was identified in the EST database (I.M.A.G.E. 
clone ID 638876). 

Construction of Plasmids. For mammalian expression constructs of 
Smurf2, the open reading frame was amplified by polymerase chain reaction (PCR) and 
5 was subcloned into pCMV5 in frame with an amino-terminal Flag or Myc tag (69). For 
Smurf2 WW domain deletions, amino acids 163-185 for DWW1, 257-279 for DWW2, 
and 303-325 for DWW3 were deleted. To generate the catalytically-inactive ubiquitin- 
ligase mutant of Smurf2, cysteine 716 was replaced with alanine. To generate the Smad7 
PY mutants, tyrosine 21 1 was replaced with alanine (Y21 1A) or the PPPPY sequence 

10 between amino acid residues 206-212 was deleted (APY). For TpRI-Flag, a Flag tag was 
introduced at the carboxy terminus of the receptor. All constructs were generated by PCR 
and confirmed by sequencing. The bacterial expression vectors, pET15-Smad7-HA and 
pGEX4T-l-Smurf2, were generated using convenient restriction sites. 

Immunoprecipitation, Immunoblotting, and Affinity-labelling. For studies 

15 in mammalian cells, 293T and COS-1 cells were transiently transfected using calcium 

phosphate precipitation, or the DEAE-dextran method, respectively. Immunoprecipitation 
and immunoblotting were carried out using anti-HA monoclonal (12CA5, Boehringer), 
anti-HA rabbit polyclonal (Santa Cruz), anti-Myc monoclonal (9E10 ascites, 
Developmental Studies Hybridoma Bank), anti-Flag M2 monoclonal (Sigma) or anti- 

20 Smad7 rabbit polyclonal antibodies. For anti-Smad7 antibodies, rabbits were immunized 
with bacterially-produced GST-Smad7 encoding amino acids 202-260. After absorption 
of the antibody to either protein G or A-Sepharose, the precipitates were washed five 
times with TNTE 0.1% (50 mM Tris, pH 7.4, 150 mM NaCl, 1 mM EDTA, 0.1% Triton 
X-100), separated by SDS-PAGE, transferred to nitrocellulose and immunoblotted with 

25 the appropriate antibody. Detection was conducted using the appropriate horseradish 
peroxidase (HRP)-conjugated sheep anti-mouse or anti-rabbit secondary antibodies and 
enhanced chemiluminescence (Amersham). Bacterially-produced His-Smad7-HA was 
incubated with either Ni 2+ -NTA beads (Qiagen) or with GST or GST-Smurf2-bound 
glutathione beads (Amersham), washed three times with TNTE (0.5% Triton X-100) and 

30 precipitates were analyzed by immunoblotting with anti-HA antibodies. For affinity- 
labelling, transfected COS-1 cells were incubated with 250 pM [ 125 I]TGF-bl at 4 °C for 1 
h, and receptors were cross-linked to ligand with DSS as described (70). The amount of 
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TpRI bound to Smurf2 or Smad7 was quantified by phosphorimaging (Molecular 
Dynamics). 

Pulse-chase analysis and ubiquitination assay. COS-1 cells were 
transfected with the indicated constructs and two days post-transfection the cells were 
5 labelled for 15 min at 37 °C with 50 mCi/ml [ 35 S]-methionine (Trans [ 35 S]-label; Figure 
2B) or 150 mCi/ml [ 35 S]-methionine (Figure 5C and D) in methionine-free Dulbecco's 
Modified Eagle's Medium (DMEM). Cell layers were then washed two times and 
incubated in DMEM containing 10% fetal bovine serum in the presence or absence of 
30 mM lactacystin (obtained from E.J. Corey, Harvard University) or 0.4 mM chloroquine 

10 for the indicated times. At each time point, cell lysates were immunoprecipitated with 
anti-HA monoclonal antibodies, resolved by SDS-PAGE and visualized by 
autoradiography. Metabolically-labelled proteins were quantified by phosphorimaging. 
For the ubiquitination assay, 293T cells were transfected with HA-tagged ubiquitin and 
combinations of receptors, Smad7 and Smurf2. Cell lysates were subjected to anti-Smad7 

15 immunoprecipitation and immunoprecipitates were boiled in 1% SDS for 5 min, diluted 
with TNTE 0.1% and reprecipitated with anti-Smad7 antibodies before anti-HA 
immunoblotting. 

Subcellular localization by Immunofluorescence Deconvolution 
Microscopy, MvlLu cells, plated on gelatin-coated Permanox chamber slides (Nunc), 

20 were transfected with the indicated constructs by the calcium phosphate precipitation 

method. The cells were fixed, permeabilized, and reacted with the primary and secondary 
antibodies as described (69). Images were obtained using the Olympus 1X70 inverted 
microscope equipped with fluorescence optics and Deltavision deconvolution microscopy 
software (Applied Precision). 

25 Transcriptional Response Assay. HepG2 cells were transiently transfected 

using the calcium phosphate DNA precipitation method with CMV-Pgal, 3TP-Lux 
reporter construct and Smad7-HA constructs as indicated. Total DNA was kept constant 
by the addition of pCMV5 empty vector. The next day, cells were incubated overnight 
with or without 100 pM TGFp. Luciferase activity was measured using the luciferase 

30 assay system (Promega) in a Berthold Lumat LB 96V luminometer and was normalized to 
p-galactosidase activity. 
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Results 

Smurf2 does not regulate Smad steady-state levels. Smads were expressed 
in 293T cells in the absence or presence of Smurf2. Surprisingly, unlike Smurfl which 
targets Smadl for ubiquitin-mediated proteolysis, Smurf2 expression did not alter the 
5 steady-state levels of Smads 1, 2, 4 or 7 (Fig. 15 A). To determine whether Smurf2 might 
associate with any of these Smads, the cell lysates were subjected to anti-Smurf2 
immunoprecipitation and associated Smads examined by immunoblotting. In cells 
coexpressing Smadl, 2, or 4, no Smads were found to coprecipitate with Smurf2 under 
these conditions (Fig. 15B). In contrast, in cells expressing Smad7, a strong interaction 

10 between Smurf2 and Smad7 was detected. To confirm that Smurf2 expression did not 
alter Smad7 turnover, we also performed pulse-chase analysis of Smad7. Smad7 
expressed in the absence or presence of Smurf2 displayed a similar half-life of 
approximately 4 h (Fig. 15C). To characterize the Smad7-Smurf2 association, GST- 
SmurO fusion protein was purified from bacteria and incubated with bacterially produced 

15 Smad7. Under these in vitro conditions Smad7 bound efficiently to Smurf2, indicating 

that Smurf2 associates directly with Smad7 (Fig. 15D). We also analyzed the determinants 
on Smad7 and Smurf2 that mediate their interaction. Smad7 possesses a PPXY sequence 
(PY motif) in its linker region, which can mediate interaction with WW domains such as 
those found in Smurf2 (71). The Smad7 PY motif was altered by changing the tyrosine 

20 residue to alanine (Smad7(Y21 1 A)) or by deleting the entire motif (Smad7(APY)). 
Analysis in 293T cells of Smurf2 binding to either Smad7 mutant revealed that the 
interaction with SmurO was reduced but not entirely abolished (Fig. 15E). This suggests 
that the P Y motif makes an important contribution to the Smad7-Smurf2 interaction, 
however, it is not the sole determinant, and other regions in Smad7 may also contribute to 

25 their association. We also made mutants of Smurf2 in which each of the three WW 

domains were deleted. The interaction of Smad7 with a Smurf2 mutant that lacked the 
first WW domain was comparable to wild type Smurf2, however deletion of either WW2 
or WW3 abolished any detectable interaction with Smad7 (Fig. 15F). Thus, the WW2 and 
WW3 domains in Smurf2 are both required to mediate binding to Smad7. Together, these 

30 results show that Smurf2 binds directly to Smad7 via its WW2 and WW3 domains but that 
it does not target Smad7 for ubiquitin-mediated proteolysis in the absence of TGFp 
signalling. 
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Smadl recruits Smurfi into a complex with the TGF/3 receptors. Smad7 
binds heteromeric complexes of TGFp type II (TBRII) and type I (TBRI) receptors through 
interactions with the activated type I receptor subunit (72, 56). The constitutive 
association between Smad7 and Smurf2 thus raised the interesting possibility that Smad7 
5 might function to recruit Smurf2 to the TGFp receptor complex. To test this, we 
expressed TGFP receptors in COS-1 cells in the presence and absence of Smad7 and 
Smurf2. Receptors were then labelled by crosslinking to [ 125 I]-TGFp. Affinity-labelled 
receptor complexes that co-precipitated with Smurf2 were visualized by autoradiography. 
In the absence or presence of Smad7, little or no TGFp receptor complexes were found to 

10 co-precipitate with wild type Smurf2 (Fig. 16). A Smurf2 catalytic mutant was 

constructed in Smur£2 in which cysteine 716 was converted to an alanine residue (C716A). 
This mutation targeted the cysteine residue in the HECT ubiquitin-ligase domain that is 
thought to form a thiolester bond with ubiquitin. When Smurf2(C716A) was expressed 
alone we detected a slight interaction with the TGFp receptors (Fig. 16). However in the 

1 5 presence of Smad7 the interaction of Smurf2(C71 6 A) with the receptors was dramatically 
enhanced. Thus, Smad7 mediates the interaction of Smurf2 with the TGFp receptors. 

We also examined Smad7 bound to the receptors. Smad7 binds 
heteromeric TGFp receptor complexes by interacting with the activated type I receptor 
subunit (72, 56). In the presence of wild type Smurf2 we observed a strong decrease in the 

20 amount of TGFp receptor complexes that co-precipitated with Smad7 (Fig. 16, lane 3). 
This correlated with a decrease in the total amount of type I receptor present in these 
transfectants. Since Smad7 binds to receptor complexes via interactions with the activated 
type I receptor, these results suggest that Smurf2 decreases the levels of Smad7-bound 
receptor complexes. Consistent with this notion, when Smad7 was co-expressed with the 

25 catalytically inactive mutant of Smurf2 5 a decrease in Smad7-bound receptor levels was 
not observed (Fig. 16, lane 5). These results indicate that the catalytic activity of Smurf2 
mediates the down-regulation of TGFp receptors that are bound to Smad7. 

Smad7 controls the subcellular localization of Smurfi. Smad7 resides in 
the nucleus and upon activation of TGFp signalling is exported into the cytoplasm where it 
30 binds to TGFp receptors through interaction with the receptor I subunit (73). To 

investigate whether Smad7 recruited Smurf2 to the receptors in intact cells, we determined 
whether Smad7 might regulate Smurf2 localization. For this, we expressed TBRII, TBRI 
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and Smurf2(C716A) in the absence or presence of Smad7 and examined the subcellular 
distribution of the appropriate protein by immunofluorescence microscopy. 

Smad7 was localized to the nucleus, but in the presence of signalling was 
found predominantly in the cytoplasm, where it also colocalized with the transiently 
5 expressed TGFp receptors in a punctate pattern. This distribution of TGFp receptors has 
been observed previously by us and others (74-76). Smurf2 was similarly found 
predominantly in the nucleus, but this localization was not altered in the presence of TGFp 
signalling. However, when Smad7 was coexpressed, the subcellular distribution of 
Smurf2 was dramatically altered and the protein was found predominantly outside of the 
10 nucleus and was extensively colocalized with the TGFp receptors. These results suggest 
that Smad7 expression leads to export of Smurf2 from the nucleus and recruitment of 
Smurf2 to the TGFp receptor complex. 

Smurfl induces degradation ofTGFfi receptors and Smad7. The strong 
decrease in Smad7-associated receptors in the presence of wild type but not mutant 

1 5 Smurf2 suggested the possibility that Smurf2 might catalyze degradation of Smad7-bound 
receptor complexes. To analyze Smurf2-dependent turnover of TGFp receptor complexes 
we coexpressed TBRII and TBRI in 293T cells. Under these conditions TBRII and TBRI 
assemble efficiently into functional heteromeric complexes, allowing us to investigate the 
turnover of the entire receptor pool. We first investigated steady-state levels of the 

20 receptors in cells expressing Smurf2 (Fig. 17 A) and observed that Smurf2 had minimal 
effects on the steady-state levels of type II or type I receptors when the receptors were 
expressed either alone or together. However, in the presence of wild type Smad7, 
increasing Smurf2 expression led to a strong decrease in the steady- state levels of the type 
I receptor (Fig. 17B). In contrast, Smurf2(C716A) had no effect. Levels of type II 

25 receptor were also decreased upon expression of wild type Smurf2, although to a lesser 
extent than that observed for the type I receptor. This difference may be due to the fact 
that Smad7 binds receptor complexes via the activated type I receptor and does not interact 
with TBRII alone. We also tested whether Smurf2 targets a constitutively active version of 
the type I receptor, T6RI(T204D). This receptor mediates TGFP signalling in the absence 

30 of the type II receptor and also binds Smad7 (72). Similar to receptor complexes, wild 
type Smurf2, but not Smurf2(C716A) caused a decrease in the steady-state levels of the 
activated type I receptor (Fig. 17B). To confirm that changes in receptor steady-state 
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levels reflected alterations in receptor turnover, we analyzed by pulse-chase analysis the 
half-life of TBRII and TBRI (Fig. 17C). Analysis of the type II receptor revealed that the 
newly synthesized protein had a half-life of approximately 1 h. In contrast TBRI was 
considerably more stable and displayed a half-life of approximately 4-6 h. Furthermore, 
5 the half-life of the type I receptor was unchanged when either Smad7 or Smurf2 were 
expressed individually with the receptors. However, when Smurf2 and Smad7 were co- 
expressed together with the TGFp receptor complex, the half-life of the type I receptor was 
decreased to approximately 1 h. Thus, Smad7 and Smurf2 enhance the turnover of the 
type I receptor. 

10 Ubiquitin-mediated proteolysis of membrane receptors can be mediated by 

both the proteasome and lysosome (60). To test whether Smurf2-dependent enhancement 
of TGFP receptor degradation was dependent on proteasome and lysosome function we 
assessed the turnover of receptors in the presence and absence of lactacystin and 
chloroquine, which inhibit protein degradation by the proteasome and lysosome, 
1 5 respectively. Pulse-chase analysis of receptors revealed that each inhibitor on its own 

caused stabilization of a subset of the total TGFp receptor pool (Fig. 17D). These results 
suggest that both the proteasome and the lysosome contribute to the enhanced turnover of 
the receptors that is observed in the presence of Smad7 and Smurf2. 

In the course of these analyses of the TGFp receptor we also evaluated 
20 Smad7 protein levels. In the absence of TGFp receptor complexes Smad7 steady-state 
level and turnover was unaffected by Smurf2 (see Fig. 15). However, in the presence of 
TGFp receptor complexes, Smad7 steady-state levels and half-life were decreased by 
Smurf2 expression (Figs. 17B and 17C, respectively). Furthermore, this decrease in 
Smad7 was dependent on the catalytic activity of the Smurf2 HECT domain, since 
25 expression of the Smurf2(C716A) mutant did not alter Smad7 levels. Smad7 turnover was 
also stabilized by lactacystin and chloroquine, suggesting that like the receptor complex, 
Smad7 is degraded by both proteasomal and lysosomal pathways. Thus, in the presence of 
TGFp signalling Smurf2 induces degradation of Smad7, possibly by targeting the entire 
receptor-Smad7 complex. 

30 To investigate ubiquitination of the receptors and Smad7, we expressed an 

HA epitope-tagged version of ubiquitin and evaluated ubiquitin-conjugates of TBRII, TBRI 
or Smad7 by immunoprecipitation followed by immunoblotting. To ensure specificity, 
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immunoprecipitated samples were boiled in SDS and then reprecipitated with the 
appropriate antibody prior to immunoblotting. Analysis of Smad7 ubiquitination revealed 
that in the absence or presence of Smurf2 there was little ubiquitin conjugated to the 
protein (Fig. 17E). However, when SmurG was coexpressed with Smad7 and the TGFp 
5 receptors, we observed high molecular weight ubiquitin conjugates of Smad7 that were not 
detected when the catalytic mutant of SmurG was used. In contrast to Smad7, we were 
unable to detect significant SmurG-dependent ubiquitination of the type I receptor. Since 
receptor turnover is increased in the presence of Smurf2 and Smad7, the inability to detect 
ubiquit in-conjugated receptors may reflect rapid degradation of the receptors that may 
1 0 occur once they are ubiquitinated. Alternatively, ubiquitination of Smad7 may serve as the 
signal that targets the entire receptor-Smad7 complex to the proteasome. Together these 
results show that Smad7-dependent recruitment of Smurf2 to the TGFp receptor leads to 
proteasome and lysosome-mediated degradation of TGFp receptor complexes and Smad7. 

Association of Smurf2 enhances the inhibitory activity ofSmad7. Our 

1 5 studies indicate that SmurG may be recruited to the TGFp receptor complex via Smad7 

and target the receptors for degradation. This suggests that ubiquitin-mediated degradation 
may contribute to the inhibitory activity of Smad7 in the TGFp pathway. To test this we 
first investigated whether Smad7(Y21 1A), which displays poor interactions with SmurG 
(see Fig. 15D), has an altered ability to recruit SmurG to the TGFp receptor. For this, 

20 wild type Smad7 or Smad7(Y21 1 A) were coexpressed with TGFp receptors in the 
presence or absence of SmurG(C716A) and receptor interaction was measured by 
analyzing affinity labelled receptors that coprecipitated with either Smad7 or SmurG. The 
interaction of TGFp receptor complexes with Smad7(Y21 1 A) was comparable to that 
observed with wild type Smad7 (Fig. 18A), demonstrating that this mutation in the PY 

25 motif does not affect Smad7 interaction with the TGFP receptor. However, the ability of 
SmurG(C716A) to associate with TGFp receptors was substantially reduced in the 
presence of mutant Smad7, correlating with the reduced efficiency observed for the 
interaction between SmurG and Smad7(Y21 1 A) described above. Next we investigated 
whether Smad7(Y21 1 A) had an altered capacity to inhibit TGFp signalling in HepG2 

30 cells, which express endogenous SmurG. For this, we assessed TGFp signalling using the 
well characterized 3 TP-lux reporter construct. Expression of wild type Smad7 strongly 
reduced TGFb-dependent induction of this reporter (Fig. 18B). In contrast, the 
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Smad7(Y21 1 A) mutant had a substantially reduced inhibitory activity, despite its ability to 
interact efficiently with the TGFp receptor complex. This suggests that binding of Smurf2 
to Smad7 enhances Smad7 inhibitory activity towards the TGFp signalling pathway. 

Previous studies have shown that Smad7 binding to TGFp receptor 
5 complexes prevents access of Smad2 to the receptors. Since Smad7(Y21 1 A) interacts 
efficiently with the TGFp receptor, we sought to determine whether the mutant protein 
might retain inhibitory activity to the TGFp pathway at higher levels of expression. To 
examine this, we compared the relative efficiency of Smad7 versus Smad7(Y21 1 A) in 
blocking TGFP signalling by varying the amount of Smad7 expression. Wild type Smad7 

10 potently inhibited TGFp signalling at the lowest doses tested, whereas Smad7(Y21 1A) 
was much less efficient (Fig. 18C). However, at the highest dose tested, Smad7(Y21 1 A) 
was capable of inhibiting TGFP responsiveness. These results indicate that 
Smad7(Y21 1 A) retains some inhibitory activity, probably through its ability to bind to the 
receptor and prevent access of Smad2 or Smad3. Together these data suggest that Smurf2 

15 cooperates with Smad7 to promote its inhibitory activity. 

Discussion 

This Example confirms the identity of a new ubiquitin ligase, Smurf2, that 
functions in partnership with Smad7 to target the TGFp receptor for degradation. Smurf2 
bound TGFP receptors very weakly and was unable to affect turnover of the receptors. 

20 However, in presence of Smad7, Smurf2 formed efficient complexes with the heteromeric 
TGFP receptor and targeted the receptors for degradation. Since Smad7 binds directly to 
Smurf2 and also associates with the TGFp receptor complex, these results indicated that 
Smad7 functions as an adaptor protein that mediates the interaction of Smurf2 with the 
TGFp receptor complex (Fig. 18D). Consistent with this, mutations in the Smad7 PY 

25 motif that disrupted Smad7-Smurf2 interaction also interfered with the ability of Smad7 to 
recruit Smurf2 into a complex with the receptor. This mutant of Smad7 was also 
compromised in its ability to block TGFp signalling, indicating that Smurf2 plays a role in 
mediating Smad7 inhibitory function. Since Smad7 competes with R-Smads for binding 
to the activated TGFp receptor, this cooperation may be particularly important when 

30 Smad7 is expressed transiently or at low levels (53-58). By targeting receptors for 
degradation, Smurf2 can thus provide a mechanism for permanent inactivation of the 
Smad7-bound receptor complex. Concomitant with Smurf2-dependent degradation of 
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TGFp receptors, we also observed degradation of Smad7 that was dependent on TGFP 
signalling. 

* * * 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the 
foregoing description and the accompanying figures. Such modifications are intended to 
fall within the scope of the appended claims. 

It is further to be understood that all base sizes or amino acid sizes, and all 
10 molecular weight or molecular mass values are approximate, and are provided for 
description. 

All patents, patent applications, publications, and other materials cited 
herein are hereby incorporated herein reference in their entireties. 
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WHAT IS CLAIMED IS : 



1 1 . An isolated Smurf protein. 

1 2. The Smurf protein of claim 1, which is human. 

1 3. The Smurf protein of claim 1 , which is a Smurfl . 

1 4. The Smurfl of claim 3 which has a mutation corresponding to 

2 C710A. 

1 5. The Smurfl of claim 3, which comprises at least 10 contiguous 

2 amino acid residues as depicted in a sequence of SEQ ID NO:2. 

1 6. The Smurfl of claim 3, which comprises an amino acid sequence as 

2 depicted in SEQ ID NO:2. 

1 7. The Smurf protein of claim 1, which is a Smurf2. 

8. The Smurf2 of claim 7 which has a mutation corresponding to 

C716A. 

1 9. The Smurf2 protein of claim 7, which comprises at least 10 

2 contiguous amino acid residues as depicted in SEQ ID NO:4. 

1 10. The Smurf2 protein of claim 7, which comprises an amino acid 

2 sequence as depicted in SEQ ID NO:4. 

1 11. An isolated nucleic acid encoding the Smurf protein of claim 1 . 

1 12. The nucleic acid of claim 1 1 , wherein the Smurf protein is a human 

2 Smurf protein. 
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13. 



The nucleic acid of claim 1 1, wherein the Smurf protein is a 



Smurfl. 



14. 



The nucleic acid of claim 1 3 which has a mutation corresponding to 



C710A. 



15. The nucleic acid of claim 13, wherein the Smurfl comprises at least 



10 contiguous amino acid residues as depicted in a sequence of SEQ ID NO:2. 

16. The nucleic acid of claim 13, wherein the Smurfl comprises an 
amino acid sequence as depicted in SEQ ID NO:2. 

17. The nucleic acid of claim 16, which has a nucleotide sequence as 
depicted in SEQ ID NO:l. 

1 8. The nucleic acid of claim 1 1 , wherein the Smurf protein is a 

Smurf2. 

19. The nucleic acid of claim 18 which has a mutation corresponding to 

C716A. 

20. The nucleic acid of claim 1 8, wherein the Smurf2 protein comprises 
at least 10 contiguous amino acid residues as depicted in SEQ ID NO:4. 

21 . The nucleic acid of claim 1 8, wherein the Smurf2 protein comprises 
an amino acid sequence as depicted in SEQ ID NO:4. 

22. The nucleic acid of claim 18, which has a nucleotide sequence as 
depicted in SEQ ID NO:3. 



23. 



A vector comprising the nucleic acid of claim 1 1 . 



24. 



A host cell comprising the vector of claim 23. 
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1 25. A method for producing a Smurf protein, comprising growing the 

2 host cell of claim 23 under conditions that permit expression of Smurf protein from the 

3 vector. 

26. A method for producing a Smurf protein, comprising growing the 
host cell of claim 24 under conditions that permit expression of Smurf protein from the 
vector. 



1 27. A transgenic non-human animal that expresses a human Smurf 

2 protein. 

1 28. A method for inhibiting a bone morphogenic protein or tumor 

2 growth factor-beta activation pathway in a cell, which method comprises permitting the 

3 cell to grow under conditions that permit expression of Smurf from the vector of claim 23 

4 introduced into the cell. 



29. A method for inhibiting a bone morphogenic protein or tumor 
growth factor-beta activation pathway in a cell, which method comprises permitting the 
cell to grow under conditions that permit expression of Smurf from the vector of claim 24 
introduced into the cell. 



1 30. A method for promoting a bone morphogenic protein or tumor 

2 growth factor-beta activation pathway in a cell, which method comprises suppressing 

3 expression of endogenous Smurf in the cell. 

1 3 1 . A method of screening for a modulator of Smurf activity, which 

2 method comprises detecting modulation of Smurf activity in the presence of a test 

3 compound relative to Smurf activity in the absence of the test compound. 

1 32. The method according to claim 3 1 , wherein the Smurf activity is 

2 ubiquitination of a Smad protein in a host cell. 
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1 33. The method according to claim 3 1 , wherein the Smurf activity is 

2 interaction of a Smurf WW domain with a PPXY domain of a Smad protein. 

1 34. The method according to claim 33, wherein the test compound is 

2 screened for the ability to inhibit the interaction. 

1 35. An antibody that specifically binds to Smurf protein. 

1 36. An oligonucleotide or nucleic acid that specifically hybridizes under 

2 highly stringent conditions to an nucleic acid having a sequence encoding Smurf. 
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ABSTRACT OF THE INVENTION 



This invention provides unique members of the Hect family of ubiquitin 
ligases that specifically target BMP and TGFp/activin pathway-specific Smads. The novel 
ligases have been named Smurfland Smurf2. They directly interact with Smadsl and 5 
and Smad7, respectively, and regulate the ubiquitination, turnover and activity of Smads 
and other proteins of these pathways. Smurfl interferes with biological responses to 
BMP, but not activin signaling. In amphibian embryos Smurfl inhibits endogenous BMP 
signals, resulting in altered pattern formation and cell fate specification in the mesoderm 
and ectoderm. The present invention provides a unique regulatory link between the 
ubiquitination pathway and the control of cell fate determination by the TGFp superfamily 
during embryonic development. Thus, Smurfl is a negative regulator of Smadl signal 
transduction, by targeting Smadl, Smurfl blocks BMP signaling. In mammalian cells, 
Smurf2 suppresses TGFp signalling, and in Xenopus, blocks formation of dorsal 
mesoderm and causes anterior truncation of the embryos. Smurf2 forms a stable complex 
with Smad7, which induces degradation and downregulation of TGFp/activin signalling. 
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FIG. 2A 
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FIG. 5C 
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FIG. 9A 



10 20 30 40 50 

***** 

GGAGGCTCCA GCATCAAGAT CCGTCTGACA GTGTTATGTG CCAAGAACCT 

60 70 80 90 100 

***** 

TGCAAAGAAA GACTTCTTCA GGCTCCCTGA CCCTTTTGCA AAGATTGTCG 

110 120 130 140 150 

***** 

TGGATGGGTC TGGGCAGTGC CACTCAACCG ACACTGTGAA AAACACATTG 

160 170 180 190 200 

***** 

GACCCAAAGT GGAACCAGCA CTATGATCTA TATGTTGGGA AAACGGATTC 

210 220 230 240 250 

***** 

GATAACCATT AGCGTGTGGA ACCATAAGAA AATTCACAAG AAACAGGGAG 

260 270 280 290 300 

***** 

CTGGCTTCCT GGGCTGTGTG CGGCTGCTCT CCAATGCCAT CAGCAGATTA 

310 320 330 340 350 

***** 

AAAGATACCG GATACCAGCG TTTGGATCTA TGCAAACTAA ACCCCTCAGA 

360 370 380 390 400 

***** 

TACTGATGCA GTTCGTGGCC AGATAGTGGT CAGTTTACAG ACACGAGACA 
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FIG. 9B 
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660 670 680 690 700 
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GAACCGACCA CACGGCCACC AGTCCCCGGA ACTGCCCGAA GGCTACGAAC 
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FIG. 9C 

B b 

***** 

AAAGAACAAC AGTCCAGGGC CAAGTTTACT TTTTGC AT AC ACAGACTGGA 
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***** 

GTTAGCACGT GGCACGACCC CAGGATACCA AGAGACCTTA ACAGTGTGAA 

810 820 830 840 850 

***** 

CTGTGATGAA CTTGGACCAC TGCCGCCAGG CTGGGAAGTC AGAAGTACAG 

860 870 880 890 900 

***** 

TTTCTGGGAG GATATATTTT GTAGATCATA ATAACCGAAC AACCCAGTTT 

910 920 930 940 950 

***** 

ACAGACCCAA GGTTACACCA CATCATGAAT CACCAGTGCC AACTCAAGGA 

960 970 980 990 1000 

***** 

GCCCAGCCAG CCGCTGCCAC TGCCCAGTGA GGGCTCTCTG GAGGACGAGG 

1010 1020 1030 1040 1050 

***** 

AGCTTCCTGC CCAGAGATAC GAAAGAGATC TAGTCCAGAA GCTGAAAGTC 

1060 1070 1080 1090 1100 

* * * * * 

CTCAGACACG AACTGTCGCT TCAGCAGCCC CAAGCTGGTC ATTGCCGCAT 
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FIG. 9D 
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AGATGCGACC GAAAGACTTG AAAAAACGGC TGATGGTGAA ATTCCGTGGG 

1210 1220 1230 1240 1250 

***** 

GAAGAAGGTT TGGATTACGG TGGTGTGGCC AGGGAGTGGC TTTACTTGCT 

1260 1270 1280 1290 1300 

***** 

GTGCCATGAA ATGCTGAATC CTTATTACGG GCTCTTCCAG TATTCTACGG 

1310 1320 1330 1340 1350 

***** 

ACAATATTTA CATGTTGCAA ATAAATCCGG ATTCTTCAAT CAACCCCGAC 

1360 1370 . 1380 1390 1400 

***** 

CACTTGTCTT ATTTCCACTT TGTGGGGCGG ATCATGGGGC TGGCTGTGTT 
1410 1420 1430 1440 1450 
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FIG. 9E 

D D 

CCATGGACAC TACATCAACG GGGGCTTCAC AGTGCCCTTC TACAAGCAGC 
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TGCTGGGGAA GCCCATCCAG CTCTCAGATC TGGAATCTGT GGACCCAGAG 
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***** 
CTGCATAAGA GCTTGGTGTG GATCCTAGAG AACGACATCA CGCCTGTACT 

1560 1570 1580 1590 1600 

***** 

GGACCACACC TTCTGCGTGG AACACAACGC CTTCGGGCGG ATCCTGCAGC 

1610 1620 1630 1640 1650 

***** 

ATGAACTGAA ACC C AATGGC AGAAATGTGC CAGTCACAGA GGAGAATAAG 

1660 1670 1680 1690 1700 

***** 

AAAGAATACG TCCGGTTGTA TGTAAACTGG AGGTTTATGA GAGGAATCGA 

1710 1720 1730 1740 1750 

***** 

AGCCCAGTTC TTAGCTCTGC AGAAGGGGTT CAATGAGCTC ATCCCTCAAC 

1760 1770 1780 1790 1800 

* * * * * 

ATCTGCTGAA GCCTTTTGAC CAGAAGGAAC TGGAGCTGAT CATAGGCGGC 
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FIG. 9F 
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1910 1920 1930 1940 1950 
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* , * * * * 
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FIG. I6B 
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FIG. I7CI 
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